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IDENTIFICATION OF OVARIAN CANCER TUMOR MARKERS 
AND THERAPEUTIC TARGETS 

PRIORITY CLAIM 

5 This application claims the benefit of U.S. Provisional Application No. 60/357,03 1, filed 

February 13, 2002, which is incorporated by reference in its entirety herein. 

FIELD OF THE DISCLOSURE 

The present disclosure is related to diagnosing, prognosing, staging, preventing, and treating 
10 disease, particularly ovarian cancer. 

BACKGROUND 

Ovarian cancer has one of the highest mortality rates of all cancers, due in part to the 
difficulty of diagnosis. Currently, epithelial ovarian cancer is the leading cause of death resulting 
15 from gynecological cancer (see Welshes/., PNAS9&: 1176-1181,2001). Studies indicate that the 
five-year survival rates for ovarian cancer are as follows: Stage I (93%), Stage n (70%), Stage m 
(37%), and Stage IV (25%) (see Holschneider & Berek, Semin. Surg. Oncol. 19: 3-10, 2000). Thus, 
there is a particular need for improved methods of early diagnosis, prognosis, and monitoring of 
ovarian cancer. 

20 Protein and mRNA levels, and changes in these levels, may be associated with specific types 

of cancer (and cancer progression). Such association is often specific to the type of cancer, meaning 
that what is overexpressed in one cancer may be under-expressed (or unchanged) in another. Thus, a 
collection or set of genes/proteins that are differentially regulated in a specific cancer may be 
indicative and specifically diagnostic of that type of cancer. 
25 .Molecular mechanisms involved in the onset ana progression of ovarian cancer reniain 

poorly understood. However, some mutations causing ovarian cancer have been identified. Between 
5% and 10% of all breast cancers are hereditary. The remaining 90% to 95% are classified as 
"sporadic," for which no genetic link to development has been identified. 

Breast cancer susceptibility genes BRCAl (GenBank Accession* U14680) and BRCA2 
(GenBank Accession #U43746) are tumor suppressor genes. Germ-line mutations of BRCAl and 
BRCAl are responsible for approximately 5-10% of all epithelial ovarian cancers (see Li and Karlan, 
Curr. Oncol. Rep. 3:27-32, 2001). Of inherited breast cancers, it is believed that inherited mutations 
in BRCAl or BRCA2 are responsible as many as 70% of all cases. 

Those with inherited mutations in BRCAl and BRCA2 have an approximately 63% lifetime 
35 risk of developing breast cancer, whereas the general female population has a 12% lifetime risk. The 
BRCAl and BRCA2 gene mutations are more often identified in breast cancer patients with poor 
prognostic factors, which are risk factors that predict for poorer treatment outcomes (eg., estrogen- 
receptor-negative tumors, higher growth rates, age less than 35 at onset of disease, breast cancer in 
both breasts). Development of disease in the opposite breast and ovarian cancer also appear to be 
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more common in breast cancer patients with BRCA1 or BRCA2 mutations. Hence, the presence of 
BRCA1 oxBRCA2 mutations may indicate a need for more aggressive therapeutic treatments. 

The alleles oiBRCAl and BRCA2 must be inactivated before tumor development occurs. 
BRCA1 and BRCA2 are believed to take part in a common pathway involved in maintenance of 
5 genomic integrity in cells; however, little is known about the specific molecular mechanisms 

involved in BRCA mutation associated (BRCA-linked) ovarian carcinogenesis. For example, it is not 
known whether BRCA1 and BRCA2 mutations affect common or unique molecular pathways in 
ovarian cancer, or if these pathways overlap with those involved in the formation of sporadic tumors. 
Both BRCA proteins have been implicated in important cellular functions, including embryonic 
10 development, DNA damage repair, and transcriptional regulation (see Scully and Livingston, Nature 
408:429432, 2000; Zheng et ai, Oncogene 19:6159-6175, 2000; Welcsh et ai, Trends. Genet 
16:69-74, 2000; and MacLachJan et ai, J. Biol. Chem. 275:2777-2785 , 2000). BRCA1 mdBRCA2 
have each been implicated in defective homologous recombination DNA repair (see Arvanitis et ai, 
International Journal of Molecular Medicine 10:55-63, 2002), and it is believed that each may be a 
15 positive regulator of homologous recombination, with BRCA2 potentially interacting with Rad5 1 , a 
central homologous recombination effector protein, and BRCA J regulating GADD45, a DNA damage 
response gene. 

Patients having cervical and endometrial cancer resulting in defects in homologous 
recombination pathways have been shown to respond favorably to radiotherapy (Arvantis et ai). 
20 Therefore, patients having ovarian cancer resulting from a defect in BRCA1 or BRCA2 may similarly 
benefit from radiotherapy treatment Hence, the ability to classify ovarian cancer patients into groups 
based upon the underlying mutation provides advantages in selecting potential courses of treatment, 
and in deciding whether to pursue a more aggressive course of treatment. 

In sum, there is a need to better understand patterns of gene expression that trigger ovarian 
25 cancer, as well as downstream genes that may serve as indicators of ovarian cancer progression or as 
potential tumor suppressors. 

BRIEF SUMMARY OF THE DISCLOSURE 

30 The present disclosure concerns a method of classifying an ovarian tumor as a BRCAl-like 

or BRCA2-like or non-BRCA-type tumor, by detenmning a pattern of expression in the ovarian 
tumor of a plurality of markers listed in Table 1, wherein the pattern of expression in the ovarian 
tumor is determined relative to a standard ovarian tissue. The pattern of expression of the markers in 
the ovarian tumor is then compared to the pattern of expression of the same markers in tissue from a 

35 known BRCAl-like or BRCA2-like or non-BRCA-type tumor. A similarity of the pattern of 
expression in the ovarian tumor to a pattern of expression of the comparison tissue of the known 
BRCAl-like tumor classifies the ovarian tumor as a BRCAl-like tumor; a similarity of the pattern of 
expression in the ovarian tumor to a pattern of expression of the known BRCA2-like tumor classifies 
the ovarian tumor as a BRCA2-like tumor; and a similarity of the pattern of expression in the ovarian 
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tumor to a pattern of expression of the known sporadic tumor classifies the ovarian tumor as a 
sporadic tumor. 

The patterns of expression are determined, for example, by determining a pattern of over- 
expression or under-expression of the plurality of markers in the ovarian tumor to over-expression or 
under-expression of the plurality of markers of the comparison tissue. Alternatively, a pattern of both 
over-expression and under-expression of the plurality of markers in the ovarian tumor is compared to 
over-expression and under-expression of the plurality of markers in the comparison tissue. 

It has also been discovered that ovarian tumors that do not contain a BRCA1 or BRCA2 
mutation may be BRCA-l-like or BRCA2-like in that the pattern of expression of the markers is 
similar to a tumor having a BRCA-1 or BRCA-2 mutation. Hence tumors that would otherwise be 
considered lt non-BRCA-type" can be classified as BRCA-l-like or BRCA-2-like, which can 
contribute to decisions about treatment and prognosis even in the absence of the mutation. 

Standard ovarian tissue serves as a baseline from which patterns of over expression and 
under expression can be determined. The "standard" ovarian tissue may be, for example, from an 
15 immortalized ovarian cell, ovarian tissue from a subject not having ovarian cancer, a subject not 
predisposed to developing ovarian cancer, or ovarian tissue from a subject from whom the ovarian 
tumor was obtained at an earlier point in time. It is also possible for the standard tissue to be tumor 
tissue taken from a patient at an earlier point in time, for example prior to treatment (for example 
prior to the admmistratioii of chemotherapy). However in most instances the "standard" tissue is 
"normal" non-tumor ovarian tissue, such as an immortalized ovarian cell line, for example an IOSE 
cell line. 

Many different approaches are described in this disclosure for determining the patterns of 
expression, and assessing similarities. In specific examples, the patterns of expression are patterns of 
logarithmic expression ratios, hierarchical clustering patterns, or multidimensional scaling patterns. 

25 The patterns maybe compared visually or statistically to arrive at conclusions regarding similarity of 
the patterns. For example, when a multi-dimensional scaling pattern is used to generate a three- 
dimensional representation of data clusters associated with BRCAl-like, BRCA2-like or non-BRCA- 
like tumors, the position of a data point obtained from the tumor specimen that is being analyzed can 
indicate whether the tumor specimen has a pattern of expression associated with one of these groups. 

30 If the data point from the tumor specimen is present within or closely associated with one of these 
clusters, it is assigned a classification the same as the cluster in which is it contained or with which it 
is associated. 

Another approach to comparing patterns of over expression and under expression is to 
assign different color intensities to standard normal deviation values of the logarithmic expression 
35 ratios. Similarities of color patterns can then be used to arrive at a qualitative assignment of a tumor 
specimen to a classification. In another approach, the logarithmic expression ratios of the plurality 
markers is compared using compound covariate predictor analysis. 

In particular examples discussed herein, a MGAl-Vke ovarian tumor is differentiated from 
a non-BRCA-like ovarian tumor by comparing relative logarithmic expression ratios of at least one 
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marker shown in Table 6. In more particular embodiments, the pattern of expression of all the 
markers in Table 6 (CD72, SLC25A11, LCN2, PSTPIP1, SIAHBP1, UBE1> WAS, IDH2 t and PCTK1) 
is compared to the pattern of expression of these same markers in the specimen undergoing 
classification. 

5 In another example, a BRCA2-hkc ovarian tumor is distinguished from a non-BRCA-like 

ovarian tumor by comparing relative logarithmic expression ratios of at least one marker shown in 
Table 7, and in some embodiments both of the markers (LOC51760 and LRPAPJ). In yet other 
examples BRCAJ- and BRCA2-Hke ovarian tumors are distinguished from non-MC4-like ovarian 
tumors by comparing relative logarithmic expression ratios of at least one marker shown in Table 8, 
10 for example PSTPIPJ, IDH2, and PCTK1, or all the markers in Table 8. In other examples, a 

BRCAM&e ovarian tumor is distinguished from a BRCA2-\ike ovarian tumor by comparing relative 
logarithmic expression ratios of at least one marker shown in Table 10, more than one marker shown 
in Table 10, or all the markers in Table 10. 

The disclosed methods also include selecting a treatment strategy based on classifying the 
15 ovarian tumor as BRCAl-Kkz, BRCA2-tike or non-MG4-like. For example, the treatment strategy 
may include selecting a more aggressive treatment regimen for a BRCAMikc or BRCA2~Kke tumor 
(even if the tumor does not contain a BRCA1 or BRCA2 mutation). Such treatment regimens can 
include chemotherapy, radiotherapy, or surgical removal of the tumor and/or surrounding tissue. 

In yet other disclosed examples, the expression patterns of a tumor specimen and known 
20 comparison tissue are compared using a database of patterns (for example a database of logarithmic 
expression patterns) associated with BRCAl-like, BRCA2-Kke or non-AKG4-like ovarian tumors. 
The database can contain, for example, expression ratios of the plurality of markers in standard tissue. 
The patterns of the expression ratios of the plurality of markers of the tumor specimen can then be 
compared to the pattern of expression ratios of the same markers in the standard tissue. 
25 In some examples, comparisons may be made just of patterns Of over expression, for one or 

more markers that is over expressed as listed in'Table 5. Alternatively, comparisons may be made 
just of patterns of under expression. The patterns of expression may be obtained by using nucleic 
acid sequences of the markers to perform nucleic acid hybridization of specific oligonucleotide 
probes to the nucleic acid sequences. The markers may be amplified prior to performing nucleic acid 
30 hybridization, and expression quantitated to detect a level of differential expression. The markers are 
conveniently provided on an array, such as a cDNA raicroarray. In one example the cDNA 
microarray contains at least 50, 100, 200, 400 or more of the markers listed in Table 1. 

The results of these comparisons can be used to diagnose or provide a prognosis of 
progression of ovarian cancer in a subject. The patterns of expression can also be used to screen for 
35 therapeutic agents for the treatment of ovarian cancer, or monitoring response to therapy in a subject, 
by looking for a return of the patterns of expression of the ovarian tumor toward a non-tumor tissue 
pattern. Kits are also provided for performing these analyses, and the kit may include arrays with 
cDNAs of the markers. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 shows the overall expression differences between BR CA /-like, BRCA2-\ike i and 
non-BRCA-tike ovarian epithelial cancers. Figure 1A. Multidimensional scaling model based on the 
overall gene expression (6,445 filtered spots, Example 1) in BRCA 1 -linked (solid circles), BRCA2- 
5 linked (open circles), and sporadic tumors (asterisks). FIG IB. The magnitude of differences in gene 
expression between various tumor groups as revealed by the number of genes differentially expressed 
among them using the uniform statistical cutoff PO.0001. 

Figure 2 illustrates that BRCA1- and £tf ^-discriminating genes also segregate sporadic 
ovarian cancers into two groups (BRCAl-tikR and BRCA2-\\ke). Figure 2A. Hierarchical clustering 
10 of 1 10 non-redundant genes (see Table 9, Addendum) showing significant differential expression 
between BRCAl-lmked and (Bl) and BRCA2-\wked (B2) tumors (modified F-test PO.0001). The 
red and green color intensities represent standard normal deviation (Z-score) values from the mean 
expression level of each gene (represented as black) across sixty-one tumor samples (Example I). 
Figure 2A> is a duplication of Figure 2A, but is printed in grey tones rather than in color. Figure 2B. 
15 Hierarchical clustering of sporadic and 2?i?G4-linked tumor samples based on the expression partem 
of the 1 10 ARG4-discrirrmiating patterns of gene expression. The B-, B2-, and C-labeled samples 
signify BRCAlAixiked, BRCA2-\wked, and sporadic tumors, respectively. Figure 2C. Hierarchical 
clustering of sporadic samples in the absence of BRCA-linked tumors reveals two major clusters 
corresponding to BRCAl-type and BRCA2~type patterns of gene expression. 
20 Figure 3 shows molecular profiles of sixty-one tumors as defined by the genes whose 

expression significantly differentiated BRCA1 and BRCA2 tumors (PO.0001) (see Example 1, and 
Table 9). The red and green color intensities represent expression levels shown as standard normal 
deviation (Z-score) values from the mean expression level of each gene (represented as black) across 
sixty-one tumor samples. The genes are numbered consecutively 1-61 in Figure 3 A, and 62-1 16 in 
25 Figure 3B. ' Figure 3A> and Figure 3B' are duplications of Figure 3A and Figure 3B, respectively, 
but are printed in grey tones rather than in color. Figure 3C shows the correlation of the designated 
rows to genes and SEQ ID NOs for the molecular profile in Figure 3A and Figure 3D shows the 
correlation of the designated rows to genes and SEQ ID NOs for the molecular profile in Figure 3B. 

Figure 4 shows gene expression differences between BRCA-linked and sporadic tumors. A 
30 modified F-test with a statistical significance level of PO.0001 was used to evaluate genes 
differentially expressed between tumor types. The red and green color intensities represent 
expression levels shown as standard normal deviation (Z-score) values from the mean expression 
level of each gene (represented as black) across all sixty-one tumor samples. Each gene name is 
followed by the corresponding LM.A.G.E. clone number spotted on the array. Figure 4A. Genes 
35 differentially expressed between i?/2G47-linked (B) and sporadic (C) samples. Genes located on 

Xpl 1 appear in red. Figure 4B. Examples of genes differentially expressed between BRCA2Amktd 
(B2) and sporadic (C) samples. Figure 4C. Examples of differentially expressed genes between the 
combined BRCA J- and BRCA2-liriked group (B and B2, respectively) and the sporadic (C) samples. 
FIG 4A-C 5 is a duplication of Figure 4A-C, but is printed in grey tones rather than in color. 
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Figure 4D BRCA 1 -linked tumors exhibit significantly higher expression levels (PO.001) of all six 
genes mapped to Xpl 1.23 compared to the sporadic cancers. Error bars reflect standard error. 

Figure 5 is a bar graph showing an evaluation of gene expression patterns common to 
BRCA-lwked and sporadic tumors. Figure 5A shows the expression of twenty-five genes that 
5 showed two-fold or greater down-regulation as compared to the IOSE reference cell line. Figure SB 
shows the expression of twenty-five genes that showed two-fold or greater up-regulation as compared 
to the IOSE reference cell line. Error bars reflect standard error. (FOS, HE4 and CD24) have been 
previously reported to be overexpressed in ovarian cancers. Several of the overexpressed genes that 
have been demonstrated to be interferon-responsive are presented in italics. The * symbol denotes 

1 0 immediate-early response genes. 

Figure 6 is a series of bar graphs illustrating semi-quantitative RT-PCR (sqRT-PCR) 
analysis of gene expression confirms the cDNA microarray data. Expression patterns of select genes 
were examined using sqRT-PCR in representative BRCAJAinkcd (bars 1-5), BRCA2-liriked (bars 6- 
10), and sporadic (bars 1 1-15) samples. The expression level of each gene in the tumor samples was 

15 compared to those of normal postmenopausal ovary (N) and the reference IOSE cells (R). All data 
has been normalized to (J-actin is presented as fold expression compared to the IOSE reference RNA. 
Figure 6 A shows results for genes HE4, RSG1, and CD74. Figure 6B shows results for genes 
ZFP36, TOP2A and HLA-DRB1. 



20 BRIEF DESCRIPTION OF THE TABLES 

TABLE 1 (see Addendum) lists 822 ovarian cancer-related nucleic acid molecules that 
show altered expression in ovarian cancer. The nucleic acids are identified by their SEQ ID NO, 
their gene name (if one has been assigned), the I.M.A.G.E Clone ID number associated with the 
nucleic acid sequence, the UniGene number (if one has been assigned), and a description of the gene 
25 (if known). Because more than one GenBank Accession Number is sometimes provided for a given ' 
nucleic acid molecule, the Table groups the SEQ ID NO assigned to each GenBank Accession 
Number with nucleic acid molecule. For example, the entry for BCKDHB in Table 1 provides SEQ 
ID NOs: 16-17 (represented by GenBank Accession number AA427739 and GenBank Accession 
number AA434304). Each of the 822 SEQ ID NOs are included in the attached sequence listing. 
30 TABLE 2 catalogs the clinicopathologic features of the tumor samples in a study of sixty- 

one cases of pathologically-confinned epithelial ovarian adenocarcinoma. 

TABLE 3 lists representative gene-specific primer sequences used to amplify RNA for 
analysis by semi-quantitative PCR. 

TABLE 4 (see Addendum) lists markers that were under-expressed in ovarian cancer in a 
35 comparison of ovarian epithelial cancer cells to normal postmenopausal ovarian tissue. 

TABLE 5 (see Addendum) lists markers that were ove-rexpressed in ovarian cancer in a 
comparison of ovarian epithelial cancer ceDs to normal postmenopausal ovarian tissue. 

TABLE 6 (see Addendum) lists markers that were differentially expressed between BRCA 7- 
linked and sporadic tumors in a comparison to reference immortalized ovarian epithelial cells. 
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TABLE 7 (see Addendum) lists markers that were differentially expressed between BRCA2- 
linked and sporadic tumors in a comparison to reference immortalized ovarian epithelial cells. 

TABLE 8 (see Addendum) lists markers that were differentially expressed between 
combined BRCAMwktd and 5*C42-linked versus sporadic rumors in a comparison to reference 
immortalized ovarian epithelial cells. 

TABLE 9 (see Addendum) lists markers that were differentially expressed between BRCA1- 
linked and ^C42-linked tumors in a comparison to reference immortalized ovarian epithelial cells. 

TABLE 10 (see Addendum) lists markers that can be used to segregate BRCAl-lxke from 
BRCA2MV& tumor types using compound covariate prediction analysis. 

TABLE 11 (see Addendum) lists the results of compound covariate predictor analysis for 
the sixty-one tumors disclosed herein, analyzed using the markers in Table 10. 



BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

The nucleic and amino acid sequences listed in the accompanying sequence listing are 
15 shown using standard letter abbreviations for nucleotide bases, and single letter code for amino acids, 
as defined in 37 C.F.R. § 1.822. Only one strand of each nucleic acid sequence is shown, but the 
complementary strand is understood as included by any reference to the displayed strand. In the 
accompanying sequence listing: 

20 SEQ ID NO: 1 is a 63-nucleotide synthetic primer containing a T7 RNA polymerase binding 

site. 

SEQ ID NOs: 2 and 3 are ACTB gene-specific primers used for amplification during semi- 
quantitative RT-PCR. 

SEQ ID NOs: 4 and 5 are HE4 gene-specific primers used for amplification during semi- 
25 quantitative RT-PCR. 

SEQ ID NOs: 6 and 7 are ZFP36 gene specific primers used for amplification during semi- 
quantitative RT-PCR. 

SEQ ID NOs: 8 and 9 are RGSJ gene specific primers used for amplification during semi- 
quantitative RT-PCR. 

30 SEQ ID NOs: 10 and 1 1 are CD74 gene specific primers used for amplification during semi- 

quantitative RT-PCR. 

SEQ ID NOs: 12 and 13 are TOP2A gene specific primers used for amplification during 
semi-quantitative RT-PCR. 

SEQ ID NOs: 14 and 15 are HLA-DRB1 gene specific primers used for amplification during 
35 semi-quantitative RT-PCR. 

SEQ ID NOs: 16 through 822 are ovarian cancer-related nucleic acid molecules that show 
altered expression in ovarian cancer. These nucleic acid molecules are listed in Table 1, and then- 
sequence information is provided in the attached sequence listing. 
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DETAILED DESCRIPTION 

/. Abbreviations 

cDNA: complementary DNA 

5 DNA: deoxyribonucleic acid 

ELISA: en2^e-linkediinmunosoibent assay 
EST: expressed sequence tag 

I.M.A.G.E.: Integrated Molecular Analysis of Genomes and their Expression 
Consortium 

10 IOSE: immortalized ovarian surface epithelial cell lines 

MDS: multidimensional scaling 

PCR: polymerase chain reaction 

RIA: radioimmunoassay 

RNA: ribonucleic acid 

1 5 RT-PCR: reverse transcriprion-polymerase chain reaction 

siRNA: small inhibitory RNA molecule 

sqRT-PCR: semi-quantitative reverse transcription-polymerase chain reaction 
STS: sequence-tagged site ^ 

20 II. Terms 

Unless otherwise noted, technical terms are used according to conventional usage. 
Definitions of common terms in molecular biology may be found in Benjamin Lewin, Genes V, 
published by Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al. (eds.), The 
Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182- 

25 9); and Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk 
Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8). 

In accordance with the present disclosure, conventional molecular biology, microbiology, 
and recombinant DNA techniques within the skill of the art are used. Such techniques are folly 
explained in the literature (see, e.g., Sambrook et al, 1989. Molecular cloning, a laboratory manual. 

30 2 nd ed. Cold Spring Harbor Laboratory, Cold spring Harbor, New York; Glover, 1985, DNA Cloning: 
A practical approach, volumes I and II oligonucleotide synthesis, MRL Press, LTD., Oxford, U.K.; 
Hames and Higgins, 1985, Transcription and translation', Hames and Higgins, 1984, Animal Cell 
Qilture; Freshney, 1986, Immobilized Cells And Enzymes, IRL Press; and Perbal, A Practical Guide 
to Molecular Cloning, John Wiley & Sons, New York, 1988). 

35 In order to facilitate review of the various embodiments of the invention, the following 

explanations of specific terms are provided: 

Altered expression or differential expression refers to expression of a nucleic acid {e.g. y 
mRNA or protein) in a subject or biological sample from a subject that deviates from that expression 
in a subject or biological sample from a subject having normal (wild-type) characteristics for the 

40 biological condition associated with the nucleic acid. Normal expression can be found in a control, a 
standard for a population, etc. For instance, where the altered expression manifests as a diseased 
condition, such as growth of a tumor or neoplasia or onset of a cancer such as ovarian cancer, 
characteristics of normal expression might include an individual who is not suffering from the 
condition (e.g., a subject not displaying neoplasia growth or not having ovarian cancer), a population 
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standard of individuals believed not to be suffering from the disease, etc. For instance, certain altered 
expression (such as altered expression of a BRCA nucleic acid), can be described as being associated 
with the biological conditions of altered (e.g., over-expressed or under-expressed) nucleic acid 
expression and a tendency to develop gynecological cancer, such as ovarian cancer. Likewise, 
altered expression may be associated with a disease. The term "associated with" includes an 
increased risk of developing the disease. 

Controls or standards (e.g., a reference cell line, such as immortalized epithelial ovarian 
cells) for comparison to a sample (e.g., an ovarian cancer tumor), for the determination of altered 
expression, include samples believed to be normal for the studied characteristic, as well as laboratory 
values, even though possibly arbitrarily set, keeping in mind that such values may vary from 
laboratory to laboratory. Laboratory standards and values may be set based on a known or 
detennined population value and may be supplied in the format of a graph or table that permits easy 
comparison of measured, experimentally determined values. 

When used in reference to a nucleic acid, amplification includes techniques that increase 
15 the number of copies of a nucleic acid molecule in a sample or specimen. An example of 

amplification is the polymerase chain reaction, in which a biological sample collected from a subject 
is contacted with a pair of oligonucleotide primers, under conditions that allow for the hybridization 
of the primers to nucleic acid template in the sample. The primers are extended under suitable 
conditions, dissociated from the template, and then re-annealed, extended, and dissociated to amplify 
20 the number of copies of the nucleic acid. The product of in vitro amplification can be characterized 
by electrophoresis, restriction endonuclease cleavage patterns, oligonucleotide hybridization or 
ligation, and/or nucleic acid sequencing, using standard techniques. Other examples of in vifro 
amplification techniques include strand displacement amplification (see U.S. Patent No. 5,744,3 1 1); 
transcription-free isothermal amplification (see U.S. Patent No. 6,033,881); repair chain reaction 
25 amplification (see WO 90/01 069); ligase chain reaction amplification (see EP-A-320 308); gap filling 
ligase chain reaction amplification (see U.S. Patent No. 5,427,930); coupled ligase detection and PCR 
(see U.S. Patent No. 6,027,889); and NASBA™ RNA transcription-free amplification (see U.S. 
Patent No. 6,025,134). 

An array is an arrangement of molecules, particularly biological macromolecules (such as 
30 polypeptides or nucleic acids) or cell or tissue samples, in addressable locations on or in a substrate. 
The array may be regular (arranged in uniform rows and columns, for instance) or irregular. The 
number of addressable locations on the array can vary, for example from a few (such as three) to 
more than 50, 100, 200, 500, 1000, 10,000, or more. A microarray is an array that is miniaturized so 
as to require or be aided by microscopic examination for evaluation or analysis. A cDNA 
35 microarray is an array of multiple cDNA molecules, fixed in addressable locations, to which 
complementary nucleic acids in applied samples may hybridize (see Hegde et al, Biotechniques 
29(3): 548-562, 2000). cDNA microarrays of the disclosure provide for qualitative and quantitative 
analysis of gene expression of the molecules contained in the array. 
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Within an array, each arrayed sample (feature) is addressable, in that its location can be 
reliably and consistently determined within the at least two dimensions of the array. Thus, in ordered 
arrays the location of each sample is assigned to the sample at the time when it is applied to the array, 
and a key may be provided in order to correlate each location with the appropriate target or feature 
5 position. Often, ordered arrays are arranged in a symmetrical grid pattern, but samples could be 
arranged in other patterns (e.g., in radially distributed lines, spiral lines, or ordered clusters). 
Addressable arrays usually are computer readable, in that a computer can be programmed to correlate 
a particular address on the array with information about the sample at that position {e.g., expression 
data, including for instance signal intensity as well as the identity of the sample). In some examples 

10 of computer readable formats, the individual features in the array are arranged regularly, for instance 
in a Cartesian grid pattern, which can be correlated to address information by a computer. 

The sample application location on an array (the "feature") may assume many different 
shapes. Thus, though the term "spot" may be used herein, it refers generally to a localized placement 
of molecules or tissue or cells, and is not limited to a round or substantially round region. For 

15 instance, substantially square regions of application can be used with arrays encompassed herein, as 
can be regions that are, for example, substantially rectangular, triangular, oval, irregular, or another 
shape. Within a single array, feature shapes do not usually vary, though they will in some 
embodiments. 

In certain example arrays, one or more features will occur on the array a plurality of times 

20 (e.g., twice) to provide internal controls. 

A biological sample is any sample in which the presence of a protein and/or ongoing 
expression of a protein may be detected. Suitable biological samples include samples containing 
genomic DNA or RNA (including mRNA), obtained from body cells of a subject, such as but not 
limited to those present in peripheral blood, urine, saliva, cells obtained by pap smear, sera, tissue 

25 biopsy, surgical specimen, amniocentesis samples and autopsy material. 

A BRCA1 -like tumor is a tumor in which the gene expression pattern is substantially similar 
to the gene expression pattern in a tumor from a subject who has a mutation in BRCAJ. Similarly, a 
BRCA2-hke tumor is a tumor in which the gene expression pattern is substantially similar to the 
gene expression pattern in a tumor from a subject who has a mutation m BRCA2. As described 

30 herein, sporadic tumors may share gene expression patterns with BRCA-linked and or BRCA2-linked 
tumors. Hence, sporadic and other tumors (such as tumors for which no BRCA genetic test has been 
conducted) that have gene expression patterns similar to a BRCAl-Iinked tumor are "BRCAl-like" 
tumors. 

A cancer is a biological condition in which a malignant tumor or other neoplasm has 
35 undergone characteristic anaplasia with loss of differentiation, increased rate of growth, invasion of 
surrounding tissue, and/or which is capable of metastasis. 

The term cancer includes ovarian cancer, such as ovarian epithelial cancer, which originates 
in the ovaries and may manifest as epithelial tumors, germ cell tumors, or stromal tumors. Also 
. included are different stages of a single cancer, for instance both primary and recurrent ovarian 
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cancer, and cancer at any progressive stage, such as Stages I-IV. Ovarian cancer is considered a 

gynecological cancer. 

A subject may be classified into an ovarian cancer stage based upon evaluation of a 

biological sample from the subject for indices known in the art or disclosed herein as being indicative 
5 of that stage of ovarian cancer. For example, a subject may be classified as having a cancer state of 

cancer-free, active ovarian cancer (i.e., stage I, II, HI, or IV ovarian cancer), or in remission from 

previous ovarian cancer. 

cDNA is a piece of DNA lacking internal, non-coding segments (introns) and regulatory 

sequences that determine transcription. cDNA is generally synthesized in the laboratory by reverse 
1 0 transcription from messenger RNA extracted from cells. 

Compound covariate prediction analysis is a method of predicting into which of two 

groups a sample will be assinged using a given statistical signficance cutoff (e.g., PO.0005). The 

method creates a multivariate predictor for one of two classes to each sample and includes in the 

multivariate predictor only those components (e.g., nucleic acids expressing on a cDNA microarray) 
15 that meet the statistical signficance cutoff. The multivariate predictor is a weighted linear 

combination of logarithmic ratios for components that are univariately significant The weight 

consists of the univariate t-statistics for comparing the classes. 

DNA is a polymer that comprises the genetic material of most living organisms (some 

viruses have genomes comprising RNA). The repeating units in most natural DNA polymers are four 
20 different nucleotides, each of which comprises one of the four bases, adenine, guanine, cytosine and 

thymine, bound to a deoxyribose sugar to which a phosphate group is attached. Triplets of 

nucleotides (referred to as codons) code for each amino acid in a polypeptide, or for a stop signal. 

The tenncodon is also used for the corresponding (and complementary) sequences of three 

nucleotides in the mRNA into which the DNA sequence is transcribed. 
25 Unless otherwise specified, any reference to a DNA molecule is intended to include the 

reverse complement of that DNA molecule. Except where single-strandedness is required by the text 

herein, DNA molecules, though written to depict only a single strand, encompass both strands of a 

double-stranded DNA molecule. Thus, a reference to the nucleic acid molecule that encodes a 

specific protein, or a fragment thereof, encompasses both the sense strand. and its reverse 
30 complement. Thus, for instance, it is appropriate to generate primers from the reverse complement 

sequence of the disclosed nucleic acid molecules. 

An expressed sequence tag (EST) is a unique stretch of DNA within a coding region of a 

gene that is useful for identifying full-length genes and serves as a land m ark for gene mapping. An 

EST is a sequence tagged site (STS) derived from cDNA. 
35 Expression of a gene is the process by which the coded information of a gene is converted 

into an operational or non-operational part of a cell, often including the synthesis of a protein. Gene 

expression can be influenced by external signals. For instance, exposure of a cell to a hormone may 

stimulate expression of a hormone-induced gene. Different types of cells may respond differently to 

an identical signal. 
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Expression of a gene also may be regulated in the pathway from DNA to RNA to protein. 
Ways in which regulation occurs include through controls acting on transcription, translation, RNA 
transport and processing, degradation of intermediary molecules such as mRNA, or through 
activation, inactivation or compartmentalization or degradation of specific protein molecules after 
5 they have been made. 

Changes in gene expression may be associated with specific types of cancer (and cancer 
progression). Such association is fairly specific to the type of cancer, and thus what is overexpressed 
in one cancer may be underexpressed (or unchanged) in another. 

The expression of several genes may be grouped into an expression pattern or expression 
10 profile. Such patterns or profiles may be unique to an individual sample depending upon certain 
factors, for instance biological stimuli introduced into the subject from which the sample was taken 
(e.g., a hormone) or ongoing disease within the subject (e.g., ovarian cancer). Thus, a coDecn'on or 
set of genes/proteins that are differentially regulated in a specific cancer may be indicative and 
specifically diagnostic of that type of cancer. In addition, specific expression patterns may indicate 
15 particular mutations within the individual that correlate and/or cause the disease, for instance a 
mutation in BRCA1 or BRCA2> or may indicate a larger class of disease, such as a BRCA1AM or 
BRCA2Aiks cancer. Furthermore, changing the expression patterns of these genes to restore the 
normal state, or bring the condition closer to the normal state in one or more characteristic, may 
constitute a treatment for cancer. 
20 As disclosed herein, the expression pattern of an unknown tumor may be compared to the 

expression pattern of known BRCAl-linked and BRCA2-linked markers to determine if the 
expression patterns are sufficiently similar to classify the unknown as a BRCA 1 -like or BRCA2-like 
tumor. 

Gene amplification or genomic amplification is an increase in the copy number of a gene 
25 or a fragment or region of a gene or associated 5' or 3' region, as compared to the copy number in 
normal tissue. An example of a genomic amplification is an increase in the copy number of an 
oncogene. A "gene deletion" is a deletion of one or more nucleic acids normally present in a gene 
sequence and, in extreme examples, can include deletions of entire genes or even portions of 
-chromosomes. 

30 A gene expression fingerprint (or profile) is a distinct or identifiable pattern of gene 

expression, for instance a pattern of high and low expression of a defined set of genes or gene- 
indicative nucleic acids such as ESTs; in some instances, as few as one or two genes may provide a 
profile, but often more genes are used in a profile, for instance at least three, at least 5, at least 1 0, at 
least 20, at least 25, or at least 50 or more. Gene expression fingerprints (also referred to as profiles) 

35 can be linked to a tissue or cell type, to a particular stage of normal tissue growth or disease 

progression, or to any other distinct or identifiable condition that influences gene expression in a 
predictable way. Gene expression fingerprints can include relative as well as absolute expression 
, levels of specific genes, and often are best viewed in the context of a test sample compared to a 
baseline or control sample fingerprint. By way of example, a gene expression profile may be read on 
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an array (e.g., a polynucleotide or polypeptide array). Arrays are now well known, and for instance 
gene expression arrays have been previously described in published PCT application number 
W09948916 ("Hypoxia-Inducible Human Genes, Proteins, and Uses Thereof), incorporated herein 
by reference in its entirety. 
5 As disclosed herein, the gene expression profile of an unknown tumor may be compared for 

similarities and differences to the expression profile of a tumor known to express in a BRCA-like 
manner (e.g., a BRCAl-like or BRCA2-like tumor). 

A genomic target sequence is a sequence of nucleotides located in a particular region in the 
human genome that corresponds to one or more specific genetic abnormalities, such as a nucleotide 

10 polymorphism, a deletion, or amplification. The target can be for instance a coding sequence; it can 
also be the non-coding strand that corresponds to a coding sequence. 

Gynecological cancers are cancers of the female reproductive system, and include cancers 
of the uterus (e.g., endometrial carcinoma), cervix (e.g., cervical carcinoma), ovaries {e.g., ovarian 
carcinoma, serous cystadenocarcinoma, mucinous cystadenocarcinoma, endometrioid tumors, 

15 celioblastoma, clear cell carcinoma, unclassified carcinoma, granulosa-thecal cell tumors, Sertoli- 
Leydig cell tumors, dysgerminoma, malignant teratoma), vulva (eg., squamous cell carcinoma, 
intraepithelial carcinoma, adenocarcinoma, fibrosarcoma, melanoma), vagina (e.g., clear cell 
carcinoma, squamous cell carcinoma, botryoid sarcoma), embryonal rhabdomyosarcoma, and 
fallopian tubes (e.g., carcinoma). 

20 An isolated biological component (such as a nucleic acid, peptide or protein) has been 

substantially separated, produced apart from, or purified away from other biological components in 
the cell of the organism in which the component naturally occurs, i.e., other chromosomal and 
extrachromosomal DNA and RNA, and proteins. Nucleic acids, peptides and proteins that have been 
isolated thus include nucleic acids and proteins purified by standard purification methods. The term 

25 also embraces nucleic acids, peptides and proteins prepared by recombinant expression in a host cell 
as well as chemically synthesized nucleic acids. 

A marker is a diagnostic indicator of disease. A marker may consist of any signal 
indicating the presence of the disease, e.g., a physiological change in the body of a subject or 
increased or decreased levels of a substance such as a protein correlated to the disease. Markers are 

30 often found in body fluid samples from a subject By way of example, prostate specific antigen is a 
tumor marker used to detect progression of prostate cancer. The molecules disclosed herein, for 
instance in Table 1 are useful as tumor markers for diagnosing, prognosing, staging, preventing, and 
treating cancerous disease, such as ovarian cancer. 

A mutation includes any change of the DNA sequence within a gene or chromosome. In 

35 some instances, a mutation will alter a characteristic or trait (phenotype), but this is not always the 
case. Types of mutations include base substitution point mutations (e.g. , transitions or transversions), 
deletions, and insertions. Missense mutations are those that introduce a different amino acid into the 
sequence of the encoded protein; nonsense mutations are those that introduce a new stop codon. In 
the case of insertions or deletions, mutations can be in-frame (not changing the frame of the overall 
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sequence) or frame shift mutations, which may result in the misreading of a large number of codons 
(and often leads to abnormal termination of the encoded product due to the presence of a stop codon 
in the alternative frame). 

This term specifically encompasses variations that arise through somatic mutation, for 
5 instance those that are found only in disease cells, but not constitutionally, in a given individual. 

Examples of such somatically-acquired variations include the point mutations that frequently result in 
altered function of various genes that are involved in development of cancers. This term also 
encompasses DNA alterations that are present constitutionally, that alter the function of the encoded 
protein in a readily demonstrable manner, and that can be inherited by the children of an affected 
10 individual. In this respect, the term overlaps with "polymorphism," as defined below, but generally 
refers to the subset of constitutional alterations that have arisen within the past few generations in a 
kindred and that are not widely disseminated in a population group. In particular embodiments, the 
term is directed to those constitutional alterations that have major impact on the health of affected 
individuals, such as those resulting in onset of a disease such as a gynecological cancer. 
15 An oligonucleotide is a plurality of joined nucleotides joined by native phosphodiester bonds, 

between about 6 and about 300 nucleotides in length. An oligonucleotide analog refers to moieties that 
function similarly to oligonucleotides but have non-naturally occurring portions. For example, 
oligonucleotide analogs can contain non-naturally occurring portions, such as altered sugar moieties or 
inter-sugar linkages, such as a phosphorothioate oligodeoxynucleotide. Functional analogs of naturally 
20 occurring polynucleotides can bind to RNA or DNA, and include peptide nucleic acid (PNA) 
molecules. 

Particular oligonucleotides and oligonucleotide analogs can include linear sequences up to 
about 200 nucleotides in length, for example a sequence (such as DNA or RNA) that is at least 6 bases, 
for example at least 8, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100 or even 200 bases long, or from about 6 to 
25 about 50 bases, for example about 1 0-25 bases, such as 12, 1 5 or 20 bases. 

A neoplasm is a new and abnormal growth, particularly a new growth of tissue or cells in 
which the growth is uncontrolled and progressive. A rumor is an example of a neoplasm. 

Anon-BRCA-type tumor is a tumor in which the gene expression pattern of the BRCA1- 
linked and BRCA2-linked markers disclosed in Table 1 is not similar to either a BRCAMike or 
30 BRCA2-like gene expression pattern. 

A nucleic acid is a deoxyribonucleotide or ribonucleotide polymer in either single or double 
stranded form, and unless otherwise limited, encompasses known analogues of natural nucleotides 
that hybridize to nucleic acids in a manner similar to naturally occurring nucleotides. 

A nucleic acid sequence (or polynucleotide) is a DNA or RNA molecule, and includes 
35 polynucleotides encoding full-length proteins and/or fragments of such full length proteins which can 
function as a therapeutic agent. 

Nucleotide includes, but is not limited to, a monomer that includes a base linked to a sugar, 
such as a pyrimidine, purine or synthetic analogs thereof, or a base linked to an amino acid, as in a 
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peptide nucleic acid (PNA). A nucleotide is one monomer in a polynucleotide. A nucleotide sequence 
refers to the sequence of bases in a polynucleotide. 

A first nucleic acid sequence is operably linked with a second nucleic acid sequence when the 
first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. 
5 For instance, a promoter is operably linked to a coding sequence if the promoter affects the 

transcription or expression of the coding sequence. Generally, operably linked DNA sequences are 
contiguous and, where necessary to join two protein-coding regions, in the same reading frame. 

An ovarian cancer-related molecule includes nucleic acids (such as DNA or RNA or cDNA) 
and proteins that are altered (for example by mutation or abnormal expression) in ovarian cancer. 
10 Pharmaceutical^ acceptable carriers include compositions and formulations suitable for 

pharmaceutical delivery of the nucleotides and proteins herein disclosed. Martin, Remington 's 
Pharmaceutical Sciences, published by Mack Publishing Co., Easton, PA, 19th Edition, 1995, 
describes conventional pharmaceutically acceptable carriers. 

In general, the nature of the carrier will depend on the particular mode of administration being 
1 5 employed. For instance, parenteral formulations usually comprise injectable fluids that include 

pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced 
salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (e.g., powder, 
pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, 
pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically- 
20 neutral carriers, pharmaceutical compositions to be administered can contain minor amounts of non- 
toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering 
agents and the like, for example sodium acetate or sorbitan monolaurate. 

Primers are short nucleic acids, preferably DNA oligonucleotides 10 nucleotides or more in 
length, which are annealed to a complementary target DNA strand by nucleic acid hybridization to 
25 form a hybrid between the primer and the target DNA strand, then extended along the target DNA " 
strand by a DNA polymerase enzyme. Primer pairs can be used for amplification of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR) or other nucleic-acid amplification methods 
known in the art. 

Primers as used in the present disclosure preferably comprise at least 10 nucleotides of the 
30 nucleic acid sequences that are shown to encode specific proteins. In order to enhance specificity, 

longer primers may also be employed, such as primers that comprise 15, 20, 30, 40, 50, 60, 70, 80, 90 
or 100 consecutive nucleotides of the disclosed nucleic acid sequences. Methods for preparing and 
using probes and primers are described in the references, for example Sambrook et al (1989) 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York; Ausubel et al (1987) 
35 Current Protocols in Molecular Biology, Greene Publ. Assoc. & Wiley-Intersciences; Innis et al 
(1990) PCR Protocols, A Guide to Methods and Applications, Innis et ah (Eds.), Academic Press, 
San Diego, CA. PCR primer pairs can be derived from a known sequence, for example, by using 
computer programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead Institute 
for Biomedical Research, Cambridge, MA). 



WO 03/068054 



PCT/US03/04688 



16 

When referring to a primer, the term specific for (a target sequence) indicates that the primer 
hybridizes under stringent conditions substantially only to the target sequence in a given sample 
comprising the target sequence. 

A probe comprises an isolated nucleic acid attached to a detectable label or other reporter 
5 molecule. Typical labels include radioactive isotopes, enzyme substrates, co-factors, ligands, 

chermluininescent or fluorescent agents, haptens, and enzymes. Methods for labeling and guidance in 
the choice of labels appropriate for various purposes are discussed, e.g., Sambrook et al (In 
Molecular Cloning: A Laboratory Manual, CSHL, New York, 1989) and Ausubel et al (In Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, 1998). 
10 A protein is a biological molecule expressed by a gene and comprised of amino acids. 

A purified molecule is one that has been purified relative to its original environment. The 
term "purified" does not require absolute purity; rather, it is intended as a relative terra Thus, for 
example, a purified protein preparation is one in which the protein referred to is more pure than the 
protein in its natural environment within a cell or within a production reaction chamber (as 
15 appropriate). Non-limiting examples of purified molecules are those that are 50%, 75%, or 90% 
pure. 

A recombinant nucleic acid is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two otherwise separated segments of sequence. This 
artificial combination is often accomplished by chemical synthesis or, more commonly, by the 

20 artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook et al, hi Molecular Cloning: A Laboratory Manual, CSHL, 
New York, 1 989. The term recombinant includes nucleic acids that have been altered solely by 
deletion of a portion of the nucleic acid. For instance, a plasmid is recombinant if some portion of 
the naturally occurring plasmid has been deleted. Equally, if the sequence of such a plasmid has been 

25 altered, for example by a nucleotide substitution (or addition or deletion), that plasmid is said to be 
recombinant. 

Sequence identity is the similarity between two nucleic acid sequences, or two amino acid 
sequences is expressed in terms of the similarity between the sequences, otherwise referred to as 
sequence identity. Sequence identity is frequently measured in terms of percentage identity (or 

30 similarity or homology); the higher the percentage, the more similar are the two sequences. Methods 
of alignment of sequences for comparison are well-known in the art. Various programs and 
alignment algorithms are described in: Smith and Waterman, J. Tlieor. Biol 91(2): 379-380, 1981; 
Needleman and Wunsch, J. Mol Bio. 48:443-453, 1970; Pearson and Lipman, Methods in Molec. 
Biology 24: 307-331, 1988; Higgins and Sharp, Gene 73:237-244, 1988; Higgins and Sharp, CABIOS 

35 5:151-153, 1989; Corpet et al, Nucleic Acids Research 16:10881-10890, 1988; Huang etal, 

Computer Applications in Biosciences 8: 155-165,1992; and Pearson et al., Meth. Mol Bio. 24: 307- 
331,1994. Altechul(?ra/.,Naf. Genet. 6(2): 119-129, 1994 presents a detailed consideration of 
sequence ahgnment methods and homology calculations. 
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The NCBI Basic Local Alignment Search Tool (BLAST) (see Altschul et al J, Mol Biol 2 15: 
403-410, 1990) is available from several sources, including the National Center for Biotechnology 
Information (NCBI, Bethesda, MD) and on the Internet, for use in connection with the sequence 
analysis programs blastp, blastn, blastx, tblastn and tblastx. The Search Tool can be accessed at the 
5 NCBI website, together with a description of how to determine sequence identity using this program 
Nucleic acid sequences that do not show a high degree of identity can nevertheless encode 
similar amino acid sequences, due to the degeneracy of the genetic code. It is understood that 
changes in nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
molecules that all encode substantially the same protein. 
10 Serial analysis of gene expression (SAGE) is the use of short diagnostic sequence tags to 

allow the quantitative and simultaneous analysis of a large number of transcripts in tissue, as 
described in Velculescu et al, Science 270:484-487, 1995. 

A standard is a reference against which a value (e.g. 7 level of expression of a marker) can 
be compared. By way of example, a non-cancerous cell line may be used as a standard for comparing 
15 the level of expression of tumor markers in an ovarian tumor sample. Non-limiting examples of 

standards useful with the disclosed methods of analysis of patterns of expression of markers include a 
non-cancerous sample (e.g., normal ovarian tissue), a sample from a subject prior to development of a 
cancer or at an earlier stage of the cancer, and a cell line (e.g:, immortalized ovarian epithelial cells, 
such as IOSE cells) considered to display wild-type expression levels of the markers. In some 
20 embodiments, a reference RNA is arbitrarily chosen, but used consistently in relation to all tumor 
samples. 

A subject is a living multi-cellular vertebrate organisms, a category that includes both 
human and non-human mammals. 

A therapeutic agent, as used in a generic sense, is a composition used for treating a subject, 
25 such as a pharmaceutical or prophylactic agent 

A transformed cell is a cell into which has been introduced a nucleic acid molecule by 
molecular biology techniques. As used herein, the term transformation encompasses all techniques 
by which a nucleic acid molecule might be introduced into such a cell, including transfection with 
viral vectors, transformation with plasmid vectors, and introduction of naked DNA by 
30 electroporation, lipofection, and particle gun acceleration. 

Treating a disease includes inhibiting or preventing the partial or full development or 
progression of a disease (e.g., ovarian cancer), for example in a person who is known to have a 
predisposition to a disease. An example of a person with a known predisposition is someone having 
a history of breast or ovarian cancer in his or her family, or who has been exposed to factors that 
35 predispose the subject to a condition, such as exposure to radiation. Furthennore, treating a disease 
refers to a therapeutic intervention that ameliorates at least one sign or symptom of a disease or 
pathological condition, or interferes with a pathophysiological process, after the disease or 
pathological condition has begun to develop. 
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In some aspects, a more aggressive treatment may be selected if warranted. By way of 
example, if a subject is found to have a BRCAJ-Hke or BRCA2-Vke gene expression pattern, a more 
aggressive treatment, such as chemotherapy, radiotherapy, or surgical removal of the affected tissue 
and/or surrounding area may be selected. 
5 A tumor is an abnormal mass of tissue, or neoplasm that may be either malignant or non- 

malignant. "Tumors of the same tissue type" refers to primary tumors originating in a particular 
organ (such as breast, ovary, bladder or lung). Tumors of the same tissue type may be divided into 
tumor of different sub-types, for example ovarian carcinomas can be further classified based on 
tumor histology as 

10 adenocarcinoma, serous, endometrial, clear cell or mixed. Tumors may also be classified according 
to a genetic abnormality associated with the development of that type of tumor. By way of example, 
a tumor associated with a defect in tumor suppressor genes BRCAI or BRCA2 is referred to herein as 
a "BRCAI- or BRCA2-]mke&" tumor. As described herein, a sporadic ovarian tumor is a tumor 
arising for a reason other than a mutation in BRCAI or BRCA2, However, the similarities in the 

15 pattern of expression of ovarian cancer markers in sporadic tumors to those in BRCAlAmked and 
2?jRG42-linked tumors can be used to classify sporadic tumors into "BRCAI -like" or "BRCA2-likQ" 
tumors, using the methods of the disclosure. A "non-BRCA-type" tumor is one that has a pattern of 
expression of ovarian cancer markers unlike a BRCAI -like or BRCA2-like tumor. 

A vector is a nucleic acid molecule as introduced into a host cell, thereby producing a 

20 transformed host cell. A vector may include nucleic acid sequences that permit it to replicate in the 
host cell, such as an origin of replication, and may also include one or more therapeutic genes and/or 
selectable marker genes and other genetic elements known in the art. A vector can transduce, 
transform, or infect a cell, thereby causing the cell to express nucleic acids and/or proteins other than 
those native to the celL A vector optionally includes materials to aid in achieving entry of the nucleic 

25 acid into the cell, such as a viral particle, liposome, protein coating or the like. 

Unless otherwise explained, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. 
The singular terms "a," "an," and "the" include plurals unless the context clearly indicates otherwise. 

30 Similarly, the word "or" is intended to include "and" unless the context clearly indicates otherwise. 
"Comprises" means "includes." It is further to be understood that all base sizes or amino acid sizes, 
and all molecular weight or molecular mass values, given for nucleic acids or polypeptides are 
approximate, and are provided for description. Although methods and materials similar or equivalent 
to those described herein can be used in the practice or testing of the present invention, suitable 

35 methods and materials are described below. All publications, patent applications, patents, and other 
references mentioned herein are incorporated by reference in their entirety. In case of conflict, the 
present specification, including explanations of terms, will control. In addition, the materials, 
methods, and examples are illustrative only and not intended to be limiting. 
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727. Description of Several Specific Embodiments 

Provided herein are methods of diagnosing or prognosing development or progression of 
ovarian cancer in a subject, which methods involve detecting altered expression of at least one marker 
5 (e.g., a nucleic acid molecule such as one listed in Table 1 or genes, cDNAs or other polynucleotide 
molecules comprising one of the listed sequences, or a fragment thereof, or a protein, such as one 
encoded by such a nucleic acid molecule, or fragment of such protein). In certain embodiments, 
altered expression is detected in more than marker, for instance in at least 50, at least 100, at least 
200, or at least 400 or more nucleic acid molecules listed in Table 1, or encoded for by a nucleic acid 
10 molecule listed in Table 1. In certain specific embodiments, no more than the molecules listed in 
Table 6, Table 7, Table 8, Table 9, Table 10 or Table 1 1 are included in such analysis. 

Additionally provided herein are methods for the classification of ovarian tumors as BRCAI- 
like, BRCA2-like or non-BRCA-like tumors based upon expression profiles of selected markers. 
Using the expression profile data, multiple types of comparisons can be made to provide qualitative 

15 and quantitative information about the tumor type. Non-limiting examples of such comparisons 
include visual exarnination of color profiles of hierarchically clustered markers on a cDNA 
microarray, multidimensional scaling to the detennine relative distance of the analyzed markers, and 
compound covariate prediction analysis to statistically classify a given tumor into one of two classes 
based upon the logarithmic expression ratio of the expression of at least one known classifying 

20 marker. In a specific non-lirniting example, logarithmic expression ratios are generated and used to 
classify tumor types by comparing to markers known to have a logarithmic expression ratio 
associated with BRCAI-lilte, BRCA2-\ike or non-BRCA-like tumors (see Example 4). 

Also encompassed herein are arrays containing two or more disclosed markers. Certain of 
such arrays are nucleic acid arrays that contain at least one marker, for instance at least one or more, 

25 such as 5, 10, 15, 25, 50, 100, 150, 200, 250, 300, 350, 400 or more nucleic acid molecules listed in 
Table 1 (or genes, cDNAs or other polynucleotide molecules comprising one of the listed sequences, 
or a fragment thereof), or a fragment of such protein, or an antibody specific to such a protein or 
protein fragment. Such arrays can also contain any particular subset of the nucleic acids (or 
corresponding molecules) listed in Tables 1-11 or all of those nucleic acids. Certain arrays (as well 

30 as the methods described herein) also may include nucleic acid molecules that are not listed in Table 
1. 

Certain of the encompassed methods involve measuring an amount of the ovarian cancer- 
related molecule in a sample (suchas a serum or tissue sample) derived or taken from the subject, in 
which a difference (for instance, an increase or a decrease) in level of the ovarian cancer-related 
35 molecule relative to a standard such as a sample derived or taken from the subject at an earlier time, 
is diagnostic or prognostic for development or progression of ovarian cancer. 

In some embodiments, altered expression of ovarian cancer-related nucleic acid molecules is 
detected using, for instance, in vitro nucleic acid amplification and/or nucleic acid hybridization. The 



WO 03/068054 



PCT/IIS03/04688 



20 



results of such detection methods can be quantified, for instance by detennining the amount of 
hybridization or the amount of amplification of the nucleic acid molecules. 

Specific embodiments of methods for detecting altered expression of at least one ovarian 
cancer-related molecule use the arrays disclosed herein. Such arrays may be nucleotide (e.g., 

5 polynucleotide or cDNA) or protein peptide, polypeptide, or antibody) arrays. In such methods, 
an array may be contacted with polynucleotides or polypeptides (respectively) from (or derived from) 
a sample from a subject. The amount and/or position of expression of the subject's polynucleotides 
or polypeptides then can be determined, for instance to produce a gene expression profile for that 
subject. Such gene expression profile can be compared to another gene expression profile, for 

10 instance a control gene expression profile from a subject having a known ovarian cancer-related 
condition. Similarly, protein arrays can give rise to protein expression profiles. Both protein and 
gene expression profiles can more generally be referred to as expression profiles. Expression profile 
data can be used to generate logarithmic expression ratios for use in compound covariate prediction 
analysis. 

15 Other embodiments are methods that involve providing nucleic acids from the subject; semi- 

quantitatively amplifying the nucleic acids to form nucleic acid amplification products using primers; 
quantifying the amount of the nucleic acid amplification products; and comparing results to 
expression levels obtained using cDNA microanalysis. The sequence of such primers may be 
selected to bind specifically to a nucleic acid molecule listed in Table 1, or a nucleic acid molecule 

20 represented by those listed in Table 1. In specific examples of such methods, the primers are selected 
to amplify a nucleic acid product encoding topoisomerase U (TOP2A) (SEQ ID NO: 448), regulator 
of G-protein signaling 1 (RGS1) (SEQ ID NO: 398), invariant gamma-chain-associated protein 
(CD74) (SEQ ID NO: 89-91), epi&dymis-specific, whey-acidic protein (HE4) (SEQ ID NO: 60), 
major histocompatibility complex, class II, DR beta 1 protein (HLA-DRB1) (SEQ ID NO: 87-88), or 

25 zinc finger protein (ZFP36) (SEQ ID NO: 167-168). 

Also encompassed are methods of ovarian cancer therapy, in which classification of a tumor 
of a patient into a BRCA /-like, BRCA2^ke or non-BRCA-like tumor type aids in the selection of a 
treatment regimen. In some examples, the treatment selected is specific and tailored for the subject, 
based on the analysis of that subjects profile for one or more ovarian cancer-related molecules. 

30 Other embodiments are kits for classifying tumors into a BRCA!-\ike, BRCA2A&R or non- 

BRCA-like tumor class, which kits may include a binding molecule that selectively binds to the 
marker that is the target of the kit. In some examples of such kits where the marker is an ovarian 
cancer-related protein, the binding molecule provided in the kit may be an antibody or antibody 
fragment that selectively binds to the target ovarian marker protein. In other examples of such kits 

35 where the ovarian cancer-related marker level is a nucleic acid, the binding molecule provided in the 
kit may be an oligonucleotide capable of hybridizing to the nucleic acid marker molecule. 

Further embodiments are methods of screening for a compound useful in treating, reducing, 
or preventing ovarian cancer or development or progression of ovarian cancer. Such methods involve 
determining if a test compound alters the gene expression profile of a subject (or ceDs of an in vitro 
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assay) so that the profile more closely resembles a wild-type expression profile than it did prior to 
such treatment, and selecting a compound that so alters the gene expression profile. In specific 
examples of such methods, the test compound is applied to a test cell. In some of such methods, the 
profile is determined or measured in an array format. 
5 Also encompassed are compounds selected using the methods described herein, which are 

useful in treating, reducing, or preventing ovarian cancer or development or progression of ovarian 
cancer. 

Also disclosed herein are uses of identified target ovarian cancer-related molecules for the 
development of antibodies, including therapeutic antibodies that affect an ovarian cancer-related 
10 pathway. It is also envisioned that the disclosed ovarian cancer-related molecules can be used as 
vaccines, for instance as "cancer vaccines" to elicit an immune response from a subject that renders 
the subject more resistant to developing or progressing through a stage of ovarian cancer. 

IV. Gene expression profiling of ovarian cancer tumor tissue using disclosed markers 
15 The present disclosure concerns gene expression profiling of ovarian tumor tissue from a 

subject for use in diagnosing, prognosing, staging, preventing, and treating the disease. Measurement 
of expression of genes within a tissue sample provides information regarding proteins that may be 
active during cancer mechanisms. Hence, the gene expression profile of tumor tissue may be 
compared against the profile for known markers for ovarian cancer, such as those disclosed herein 
20 (see Table 1). 

Using the gene expression profile, an ovarian tumor from a subject may be classified into a 
BRCAl-like, BRCA2-\\ke i or non-##G4-like tumor. Because the prognosis for a patient having a 
BRCA1 or BRCA2 mutation is poorer than for patient having non-BRCA-like mutations, 
classification of tumors into these groups is helpful in selecting treatment strategies and aids a 

25 clinician in deciding whether to employ a more aggressive regimen in treating the patient, for 
instance radiotherapy, chemotherapy, or surgical removal of the affected tissue. In addition, 
classification of a sporadic tumor into ai?i*G4i-like or BRCA2-\ike classification may provide 
similar guidance in treating the patient. For example, a subject who has a 2?^CM/-like or BRCA2-like 
sporadic tumor may be treated similarly to a subject who has a BRCAJAwked or 2?/?G42-linked 

30 tumor. The identification of BRCA1- or BRCA2-\ikt sporadic tumors also allows tumors (or subjects) 
to be selected for specific drug regimens that are particularly effective with the associated mutation 
type. 

The ovarian cancer-linked markers disclosed herein are believed to be useful as diagnostic 
or prognostic indicators of 5flC4/-like, 5i?G42-like and non-2?i?G4-like ovarian cancer. In addition, 
35 the markers are believed to be useful in applications for treating ovarian cancer as the basis of new 
therapeutic targets, for the development of new anti-cancer therapeutic compounds, and/or to select 
particularly appropriate existing treatments. For example, the expression levels of these markers can 
be examined to monitor the effectiveness of anti-cancer treatments where an increase in or decreased 
level of nucleic acid expression opposite of the ovarian cancer-indicative pattern disclosed herein 
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indicates an effective anti-cancer treatment Beyond use in generating an expression profile, certain 
of the identified genes or EST sequences provided herein are believed to have individual use as 
cancer markers. 

5 A, Generating gene expression information and logarithmic expression ratios, 

cDNA microanalysis allows for simultaneous analysis of the expression of multiple genes 
within various tissue samples, and is therefore useful in generating gene expression profiles. To 
perform cDNA microarray analysis, RNA is isolated from a subject and cDNA is synthesized from 
the RNA according to standard methods (see Sambrook et al, Molecular cloning, a laboratory 

10 manual 2 nd ed. Cold Spring Harbor Laboratory, Cold spring Harbor, New York, 1989). Relative 
over-expression of the mRNA in the cancerous tissues can be measured against non-cancerous 
reference baselines (e.g., ovarian tissue from a subject not having ovarian cancer or an ovarian cell 
line, such as an immortalized ovarian cell line), to provide a framework for detennining normal 
expression versus altered expression (genes that are either overexpressed or underexpressed). 

15 Nucleic acids that are overexpressed may be used as markers for ovarian cancer, while genes that are 
underexpressed maybe putative tumor suppressors. 

cDNA microarrays containing 7,651 sequence- verified features were constructed and 
applied to analyze the mRNA expression profile of sixty-one subjects with pathologically-confirmed 
epithelial ovarian adenocarcinoma having matched clinicopathologic features (see Alizadeh et ah, 

20 Nature 403: 503-51 1, 2000; Perou et al, Nature 406: 747-752, 2000; Bubendorf et al, J Natl. Cancer 
Inst, 91(20): 1758-64, 1999; Welsh et al, Proc Natl Acad Sci. USA 98: 1176-1181,2001). These 
included eighteen cases linked to BRCA1 founder mutations, sixteen cases linked to BRCA2 founder 
mutations, and 27 cases negative for any founder mutations (e.g., sporadic ovarian epithelial cancer). 
These samples were compared to expression levels of these same features in an immortalized normal 

25 ovarian surface epithelium cell line (IOSE). Statistical tools, including a modified F-test with ' 

P<0.0001 considered significant (e.g., a 99.99% confidence level) were then used to analyze the data 
(e.g., to differentiate gene expression profiles associated with ovarian cancer), enabling a 
comprehensive, genomics-based analysis of the mRNA expression profiles of these ovarian cancer 
subjects. 

30 The logarithmic expression ratios for the spots on each array were normalized by subtracting 

the median log ratio for the same array. Data were filtered to exclude spots with size less than 25 
(am, intensity less than two times background or less than 300 units in both red and green channels, 
and any flagged or missing spots. In addition, any features found to be missing or flagged in greater 
than 10% of the arrays were not included in the analysis. Application of these filters resulted in the 

35 inclusion of 6,445 of the total 7,651 features in subsequent analyses. Statistical comparison between 
tumors groups was performed using the "BRB Array Tools" software (developed by Dr. Richard 
Simon and Amy Peng, Biometrics Research Branch, Division of Cancer Treatment and Diagnosis, 
NCI, USA), consisting of a modified F-test with P < 0.000 1 (99.99% confidence level) considered 
significant (see Example 4). This stringent P value is selected in lieu of the Bonferroni correction for 
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multiple comparisons, which was deemed excessively restrictive (see Bland and Altaian, BMJ.\ 310: 
170, 1995). See Example 4 for an example of how an ovarian tumor is analyzed using the disclosed 
methods. 

In addition to statistical analysis, multidimensional scaling (MDS) and hierarchical 
5 clustering techniques using a correlation metric and average linkage were used for evaluating overall 
gene expression. Using these techniques, a large set of genes and other encoding sequences (e.g., 
expressed sequence tags, ESTs) have been identified (Table 1), the expression of which varies in 
subjects having ovarian cancer (see Addendum). Other confidence levels could be used to select 
ovarian cancer-related molecules, such as 98%, 95%, 90%, 85%, and so forth (see Jain et a/., IEEE 

1 0 Transacations on Pattern Analysis and Machine Intelligence 22(1): 4-37, 2000). Molecules 

identified as being linked to ovarian cancer (referred to generally herein as ovarian cancer-related 
molecules) using the methods described herein can be arranged on arrays for use in diagnostic and 
prognostic methods. Specific arrays are contemplated that are constructed using molecules identified 
at differing confidence levels. Specific examples of such arrays include arrays that detect altered 

1 5 expression of at least 2, 5, 1 0, 20, 30, or 50 of these molecules. 

B. Comparison of ovarian epithelial adenocarcinoma cells to immortalized ovarian 
surface epithelium cells. 

In a comparison of ovarian epithelial adenocarcinoma cells to immortalized ovarian surface 

20 epithelium cells, the largest contrast in gene expression was observed between BRCA1- and BRCA2- 
linked tumors, with multiple genes showing significant differences in expression levels. This group 
of genes was also able to segregate the sporadic tumors into two major "BRCAJ-like" and "BRCA2- 
like" subgroups, indicating that57?C4-related pathways are also involved in sporadic ovarian 
cancers. In addition, two previously unreported gene expression patterns were noted. First, six of the 

25 genes differentially expressed between £££4 /-linked arid sporadic tumors'map to Xpl 1.23 and all 
exhibited higher mean expression levels in the MG4/-linked samples [WAS (SEQ ID NO: 524-526), 
EBP (SEQ ID NO: 529), SMC1L1 (SEQ ID NO: 529), PCTK1 (SEQ ID NO: 527-528), ARAF1 (SEQ 
ID NO: 531-532), and UBE1 (SEQ ID NO: 533), see Figure 3]. Second, compared to immortalized 
ovarian surface epithelium cells, several interferon-inducible genes were noted to be overexpressed in 

30 the majority of all tumor samples [SJAT1 (SEQ ID NO: 73), TNFSF10 (SEQ ID NO: 104-106), 

ABCB1 (SEQ ID NO: 164-166), CP (SEQ ID NO: 83-84), HLA-DRB5 (SEQ ID NO: 85-86), HLA- 
DRB1 (SEQ ED NOS: 87-88, 100, 101-103), CD74 (SEQ ID NO: 92-93), HLA-DRA (SEQ ID NO: 
94-96), HLA-DPA (SEQ ID NO: 97-99), IFTTM1 (SEQ ID NOS: 50-51, 52-54), IFJTM2 (55-57, 58- 
59),i42Af(SEQIDNO: 193-195), GJP3 (68-69), IGKC (SEQ ID NOS: 112-114, 115-116), SCYB10 

35 (SEQ ID NO: 120-121), Col3Al (SEQ ID NO: 141-143), HLA-B (SEQ ID NO: 154-156), and HLA-C 
(SEQ ID NO: 157-159), see Figure 4]. In terms of overall differential gene expression, BRCAJ and 
BRCA2-]mked tumors express genes more different from each other than from sporadic (non-BRCA- 
linked) tumor samples. 
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The identified ovarian cancer-related genes represent putative mediators of ovarian cancer, 
and as such are candidate targets for the development of novel therapeutics for the treatment of 
ovarian cancer using conventional techniques. By way of example, a candidate drug, targeted at 
restoring expression of a gene of the disclosure, could be examined using cDNA microarray analysis 
5 for utility in influencing growth of ovarian cancer cells. Thus, use of cDNA microarray techniques 
for genomics-based discovery of genes variably expressed during ovarian cancer provides for the 
identification of novel therapeutic targets for treatment of ovarian cancer. 

It is contemplated that certain of the ovarian cancer markers identified herein encode or 
correspond to soluble proteins, while others encode or correspond to membrane associated or 

10 membrane integral proteins, some of which are exposed at least to a certain extent on the exterior of a 
cell in which they are expressed. In some embodiments, those ovarian cancer-related molecules that 
are expressed at or on the surface of a cell are selected as therapeutic targets, for instance for 
targeting with an antibody-based therapy, which is facilitated by the access of the ovarian cancer- 
related molecule to the extracellular matrix These ovarian cancer markers may be described as being 

15 "drug accessible." In addition, such soluble ovarian cancer markers, if secreted, may be detected in a 
blood or serum sample from the subject. 

C. Comparison of ovarian epithelial cancer cells to normal postmenopausal ovarian 

samples. 

20 cDNA microarrays containing 7,600 sequence-verified features were constructed and 

applied to analyze the mRNA expression profile of 61 subjects with ovarian epithelial cancer as 
compared to two normal postmenopausal ovarian samples. 

Gene expression in each sample (normal or tumor) was directly compared to a "reference 
RNA" consisting of a mix of nine different human cell lines (breast adenocarcinoma, hepatoblastoma, 

25 cervical adenocarcinoma, testicular embryonal carcinoma, glioblastoma, melanoma, liposarcorha, 
histiocytic lymphoma, T cell lymphoblastic leukemia, and plasmacytoma/myeloma, Stratagene, La 
Jolla, CA). The raw gene expression data was used to calculate the logarithmic expression ratio for 
each gene. The logarithmic expression ratios ("log ratio") obtained from this comparison were then 
normalized and statistically compared to one another, providing for indirect comparison of gene 

30 expression in tumors and normal ovarian samples. This was accomplished by scoring the magnitude 
of differential expression of each gene (between normal and cancer samples) according to the 
formula: 

f average cancer log ratio - average normal log ratio) » magnitude of differential 
35 (standard deviation cancer + standard deviation normal) expression 

In genes showing a large mean expression difference between normal and cancerous samples, the 
magnitude of differential expression has a greater value, while the intra-group variability in 
expression ratios is low. 
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Genes were then ranked according to the magnitude of differential expression and the 
highest-ranking genes were considered to be the best candidates for differentiating normal from 
malignant ovarian samples (see Furey et al y Bioinformatics 16(10): 906-14, 2000). 

Using these techniques, a large set of genes and other encoding sequences (e.g., ESTs) that 
5 are under-expressed in subjects having ovarian cancer have been identified (see Table 4). These 
under-expressed ovarian cancer markers represent putative turnor suppressors, and as such are 
candidate targets for the development of novel therapeutics for the treatment of ovarian cancer using 
conventional techniques. By way of example, induction of expression of one or more of these 
markers through therapeutic means (e.g., induction by a drug or gene therapy) may inhibit tumor 
1 0 growth and/or increase tumor cell death, for instance through stimulation of apoptotic pathways. 

Furthermore, a large set of genes and other encoding sequences (e.g., ESTs) have been 
identified (see Table 5), the expression of which is overexpressed in subjects having ovarian cancer. 
These overexpressed ovarian cancer markers represent putative mediators of ovarian cancer, and as 
such are candidate targets for the development of novel therapeutics for the treatment of ovarian 
15 cancer using conventional techniques. Over-expression of one or more such markers can also be 
detected in the body (for example using a serum test to detect or monitor progression of ovarian 
cancer). 

In addition, six of the markers identified herein (eg., WAS (SEQ ID NO: 524-526), PCTK1 
(SEQ ID NO: 527-528), UBEJ (SEQ ID NO: 533), SMC1L1 (SEQ ID NO: 529), ARAF1 (SEQ ID 
20 NO: 53 1-532), and EBP (SEQ ID NO: 529)) have all been mapped to chromosome Xpl 1 (see 

Example 1). Hence, this chromosome could contain additional genes and ESTs that may be useful as 
markers for prognosing, diagnosing and monitoring ovarian cancer. The methods of the disclosure 
can be used to find additional genes and ESTs in this region for use as ovarian cancer markers. 



25 V. Methods of Classifying Tumors into Subgroups 

Disclosed herein are multiple methods of classifying tumors into subtypes based upon the 
expression of disclosed ovarian tumor markers (see Table 1). 



A. Comparison of raw expression data 

30 The expression data of one or more ovarian cancer markers can be compared between 

samples and analyzed to detect differences in expression between the markers. The expression of an 
individual marker can be stated in ratio or "fold" form relative to the expression of the standard. For 
instance, in Table 4, the average logarithmic ratio of the gene expression for the standard ("normar) 
for ITM2A (SEQ ID NO: 202) is 1.145, while the average logarithmic ratio of the gene expression in 

35 cancer cells was -2.036. These numbers can be compared to derive a value for the difference in 

expression by calculating the expression ratio of each number, and dividing the expression ratio for 
the average log cancer value by the expression ratio for the average log normal value. Hence: 



Expression ratio of average log normal: 2 1145s =2.21 1 
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Expression ratio of average log cancer 2" 2 03<s = 0.244 
Ratio (cancer to normal) - (0.244)/(2.21 1) - 0. 1 10 

5 Thus, ITM2A is under-expressed in cancer by a ratio of 0.1 10 to 1 (Le., in ovarian cancer tissue, 
ITM2A expresses at approximately 10% of the expression level seen in wild-type cells). 

Collections of such data can be assembled to provide a gene expression profile, as discussed 
above. With such profiles, the standard deviation of the expression ratio of each gene can be 
1 0 measured by obtaining the square root of the variance of the expression data as described by Jaccard 
and Becker (in Statistics for the Behavioral Sciences, 2 nd ed., Wadsworth Publishing Co., Belmont, 
California, 1990) and Myers and Well, (in Research Design and Statistical Analysis, University of 
Massachusetts, Amherst, Massachusetts, 1995). 

Further analysis can include a Student's t-test, to determine if the mean expression of two 
15 groups (e.g., BRCAMikc and non-^G4-like, BRCA2-Vkt and non-tftfC/Mike, etc.) are statistically 
different from each other. 

Due to the range over which genes may express, it may be useful to perform statistical 
analyses using the logarithmic expression value for each marker (see Example 1). However, 
calculations using the logarithmic expression values may dilute the ability of certain analyses to 
20 determine differences. Hence, it may be useful to employ multiple methods of analysis to ascertain 
relative values in expression (see Jain et aL, IEEE Transacations on Pattern Analysis and Machine 
Intelligence 22(1): 4-37, 2000). 

B. Visual analysis of hierarchical clustering 

25 Methods disclosed herein include hierarchical clustering analysis of genes with statistically 

significant differential expression between sets of tumor groups. Hierarchical clustering can be used 
to cluster objects (e.g., genes, such as the ovarian cancer markers listed in Table 1) to represent 
relationships among the objects. The relationships are represented, for example by a tree whose 
branch lengths reflect the degree of similarity between the objects (see e.g., Figure 2B). 

30 Optionally, hierarchical clustering can be combined with a graphical representation of the 

primary data by representing each data point with a color that quantitatively and qualitatively reflects 
the original experimental observations. The use of color representations, along with statistical 
organization, provides a graphical display that provides visual information about expression of the 
genes. Hence, the methods disclosed herein can provide visual information regarding degrees of 

35 similarity {e.g. y patterns of under-expression or over-expression) between assessed genes in different 
samples, for instance in samples of BRCA1 -linked, BRCA2-lwkcd and sporadic ovarian tumor 
samples (see Figure 2B). 

At the first iteration, each object is considered to be its own group, and the pair of objects 
with the smallest distance between them is merged into a new group. Each subsequent iteration 
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merges two groups to form a new group, until finally all objects end up merged into a single group. 
The classification tree, or dendrogram, graphically represents the sequence of clusters formed at each 
iteration of merges, as well as the distance between clusters at each merge (here, Figure 2). This 
technique is widely employed to represent gene expression information obtained from microarray 
5 experiments (see Eisen et al, Proc. Natl Acad. Sci. U.S.A. 95(25): 14863-8, 1998). 

The gene expression data disclosed herein were analyzed by calculating the Pearson 
correlation coefficient to obtain a gene expression similarity metric. To describe, Gi is (log- 
transformed) primary gene expression data for gene G in each tumor sample, represented as variable 
L For any two genes X and Y observed over a series of N tumor samples, a similarity score can be 
10 computed as follows: 

S^^-N, ^ [ * x j [ * y J 

where 



When Goffsa is set to the mean of the gene expression levels of the tumor samples for gene G, then 0 G 
becomes the standard deviation of G, and S(X, Y) is exactly equal to the Pearson correlation 
1 5 coefficient of the gene expression levels for genes X and Y. Values of G<# set that are not the average of 
the gene expression levels for gene G are used when there is an assumed unchanged or reference state 
(e.g., the gene is not over-expressed or under-expressed) represented by the value of G ojrset , against 
which changes are to be analyzed; in all of the examples presented here, G Qffsel is set to 0, 
corresponding to a fluorescence ratio of 1.0. 

20 By way of example, Figures 2A and 2A* demonstrate that expression of the disclosed 

markers can be used to visualize different tumor types. Hierarchical clustering was performed using 
with a Pearson correlation metric and average linkage were used for evaluating overall gene 
expression for the sixty-one BRCA /-linked, 2?flC42-linked and sporadic tumors (see Example 1). 
When applicable, all statistical tests were two-sided. 

25 In Figure 2, B2 represents BRCA2-\wked tumors, and Bl represents BRCAlAmksd tumors. 

The red and green intensities represent standard normal deviation (Z score) values from each 
marker's means expression level (represented as black) across the sixty-one tumors samples. Red 
represents increased expression and green represents decreased expression. The differences in gene 
expression can be appreciated by looking at the groupings apparent in Figure 2A. The genes in the 

30 left half of the Figure 2 A are from £J?G42-linked tumors and the genes in the right half are from 
2?/2G4/-linked tumors. As can be seen with casual observation, gene expression between these two 
tumor groups differs relative to the control (IOSE cells). Specifically, BRCA2-lmked tumors contain 
under-expressing genes that correlate to these genes in the upper left and lower right quadrants of 
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Figure 2A, which are represented as primarily green in color. Furthermore, the genes in the upper 
right and lower left quadrant, which are represented as primarily red in color, correlate to genes that 
are generally over-expressed relative to the control IOSE cells. Hence, hierarchical clustering can he 
used to qualitatively visualize differences in the expression patterns of samples. 

5 

C. Multidimensional Scaling 

Multidimensional scaling is a dimension reduction procedure that can be used for 
visualization purposes. Each experiment can be represented by its expression profile, which is a K- 
dimensional vector of log-ratios, where K is the number of clones represented after filtering. The 

1 0 multidimensional scaling procedure reduces each experiment's expression profile from K- 

dimensionai space to 3-dimensional space, by attempting to preserve distances between the N 
experiment vectors. The distance metric needs to be specified when using the multidimensional 
scaling tool. First, the N xN distance matrix is computed, which quantifies the relationships between 
the N experiments in the series of chips. For each of the N vectors in K-dimensional space, the 

15 multidimensional scaling procedure finds a vector in 3-dimensional space, such that the N x N 

distance matrix computed in 3-dimensional space approximates the N x N distance matrix computed 
in K-dimensional space. The relationships between the N experiments can then be visualized by 
plotting the N vectors in 3-dimensional space, in which each of the N points represents a single 
experiment A rotating 3-dimensional visualization tool can be used for discovery of experiment 

20 clusters. 

By way of example, the gene expression data of 6445 filtered genetic elements of the sixty- 
one ovarian tumor samples (see Example 1) was used in multidimensional scaling to generate a 3-D 
diagram for visualization of the respective differences between the expression patterns of each tumor 
sample. As seen in Figure 1, the data segregate into different areas of the 3-D space based on 

25 similarities in gene expression within the tumor type. In particular, the ^C^7-linked*turriors (dark " 
circles) segregate higher into the cube than the BRCA2-11nked tumors (open circles). The sporadic 
tumor samples (asterisks) also fell into higher and lower areas of the cube, indicating that they 
segregate into2?i?C/4/-type euidBRCA2-typG expression patterns. Thus, multidimensional scaling can 
be used to make a qualitative distinction regarding the expression patterns of these samples. 

30 Multidimensional scaling can be used to qualitatively assess the expression pattern of an 

unknown tumor type. Expression data for a plurality of BRCAl-type and BRCA2-type markers is 
generated using the tumor tissue (for instance, on a cDNA microarray) relative to a standard ovarian 
tissue (e.g., from a subject not having ovarian cancer, immortalized ovarian epithelial cells, etc.), and 
logarithmic ratios of the gene expression data are calculated. To compare the pattern of expression of 

35 the plurality of the known BRCAl-type and BRCA2-type markers to the unknown ovarian sample, the 
K-dimensional vectors of the logarithmic expression ratios for all expression data are calculated as 
discussed above. Next, the K-dimensional vectors are plotted in a 3-dimensional space and the layout 
of the data compared. Similar to Figure 1, the unknown sample data should cluster either near the 
£#G4/-like or BRCA2Aik& tumors, or alone (which would indicate that it is a non-BRCA-like 
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tumor). Hence, multidimensional scaling can be used to make a qualitative distinction regarding the 
expression patterns of an unknown samples in comparison to known BRCA /-type and BRCA2-typQ 
markers. In addition, more than one unknown sample can be used in this analysis. 



5 2). Compound Covariate Predictor Analysis 

Segregation into tumor types can be performed using compound covariate predictor analysis, 
which creates a multivariate predictor for one of two classes to each sample (see Example 4). 
Markers included in the multivariate predictor are those that are univariately significant at the 
selected significance cutoff (e.g., PO.0005). The multivariate predictor is a weighted linear 
10 combination of log-ratios (or log intensities for single-channel arrays) for genes that are univariately 
significant. The weight consists of the univariate t-statistics for comparing the classes, and is 
calculated using the equation: 



CCP-O/t^fa-m,) 

15 

where t^ = t-value for gene i (see Table 10), X/ = logarithmic ratio of the gene expression (i) in the new 
sample to be classified, and m, = midpoint between the two classes for gene i (see Table 10). The 
index / runs over all the genes that are significant in the original analysis (i.e. all 62 genes in Table 
10). If the log ratio x, is missing for gene i in the new sample to be classified, then it should be 

20 assigned as m/ for that gene, to cause the result of the calculation to be zero for that gene. If the 

compound covariate predictor value is positive, then the tumor classified as one of the first type (e.g., 
BRCAl-like). If the compound covariate predictor value is negative, then the tumor is classified as 
belonging to the second type (e.g., BRCA2-like). 

A second method of tumor classification using a compound covariate predictor model can be 

25 found in Radmacher et al, "A paradigm for class prediction using gene expression profiles," found 
on the National Cancer Institute Internet website. This publication is expressly incorporated by 
reference herein. 

In order to determine whether a tumor is classified as BRCAI-like or BRCA2-]ike using a 
single markers in Table 10, the following steps are used: 

30 

1 . Gene expression information is obtained, for instance on a cDNA microarray, using 
the same standard (e.g., IOSE cells) that was used to obtain the marker data. 

2. The gene expression data is converted into a logarithmic ratio using log base 10. 
Hence, a tumor that has a gene expression value for gene KIAA00008 of 0.45 would have a log base 

35 10 ratio of -0.346. 

3. The midpoint value of Table 10 is subtracted from the logarithmic ratio, and 
multiplied by the t-value for KIAA00008 for the data set. Thus, 

[(-0.346) - (-.431)] * (-8.0421) = -0.51. 
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The values for the average logarithmic ratio for BRCA /-linked and BRCA2-\wked values in 
the data set are then consulted. The obtained value will fall between the midpoint and one of these 
values because genes in which larger values of the logarithmic ratio are assigned to one class (e.g., 
BRCAl-lxaked) will have weights of a value that is more negative with respect to the midpoint value 
5 (e.g. 9 -0.56864), whereas genes in which larger values of the logarithmic ratios are assigned to the 
other class (e.g., 2?/*G42-linked) will have weights of a value that is more positive with respect to the 
midpoint value (e.g., -0.29414). Hence, the obtained value, 0.1930, would fall on the more negative 
side of this data, and would therefore be classified as a BRCA2-\\k& data set. 

If this same analysis is performed using multiple markers, the method remains the same 
10 except that the data can be summed prior to performing the analysis. This method is a multivariate 
approach of the compound covariate analysis, and can be used to determine whether the pattern of 
expression of an unknown tumor is similar to a BRCA 1 -like or BRCA2-like pattern of expression. 

Further analysis, such as a "leave-one-out" approach may additionally be employed to test 
the ability of the Compound Covariate Predictor to classify the tumors into additional subtypes, such 
15 as resistance to a therapeutic compound. See Radmacher et al, "A paradigm for class prediction 
using gene expression profiles," found on the National Cancer Institute Internet website, 

E. Comparisons using Databases 

Due to the large amount of information associated with the analysis methods disclosed 
20 herein, it may be particularly useful to construct and/or consult databases of information for use in the 
analysis. 

By way of example, the information generated by the methods of the disclosure can be 
stored in databases, such as a database of a plurality of markers known to express differently in 
BRCAJ-]ike and£i?C42-like tumors (eg., Table 9). Such databases may be made publicly available, 

25 such as the Stanford Microarray Database (maintained by Stanford University, see Sherlock et al, 
Nucleic Acids Res. t 29(1): 152-155, 2001). These databases may be used to store reference data for 
use with the classification methods of the disclosure. In addition, such databases can be used to 
provide information regarding markers of potential use in diagnosing, prognosing, or monitoring 
ovarian cancer, for use by clinicians. 

30 The use of databases to search for stored information is disclosed in U.S. 5,871,697 and 

6,519,583 the methods of which are expressly incorporated herein. 

VI. Kits for measuring the level or function of ovarian cancer-related molecules. 

The nucleic acid sequences and ESTs disclosed herein can be supplied in the form of a kit 
35 for use in detection or monitoring ovarian cancer. In such a kit, one or more of the nucleic acid 
sequences and/or ESTs in Table 1 are provided in one or more containers, or in the form of a 
microarray. The kit may also contain reagents for use in preparing a biological sample of a subject 
for screening with the kit. The containers) in which the reagent(s) and microarray(s) are supplied 
can be any conventional container that is capable of holding the supplied form, for instance, plastic 
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boxes, microfuge tubes, ampoules, or bottles. In some applications, negative controls obtained from 
a subject free from ovarian cancer may be provided inpre-measured (e.g., single use) amounts in 
individual, typically disposable, tubes or equivalent containers. With such an arrangement, the 
sample to be tested for the presence of ovarian cancer can be added to the testing container and tested 
5 directiy. 

The amount/number of each testing reagent and container supplied in the kit can be any 
appropriate amount, depending for instance on the market to which the product is directed. For 
instance, if the kit is adapted for research or clinical use, the amount of each testing reagent and 
container provided would likely be an amount sufficient to screen several biological samples. Those 
10 of ordinary skill in the art know the amount of testing reagent that is appropriate for use in a single 
container. General guidelines may for instance be found in Innis et al (PCR Protocols, A Guide to 
Methods and Applications, Academic Press, Inc., San Diego, CA, 1990), Sambrook et al (In 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989), and Ausubel et 
al (In Current Protocols in Molecular Biology, Greene Publ. Assoc. and Wiley-Intersciences, 1992). 

15 A kit may include more than two nucleic acid sequences or ESTs, in order to facilitate 

screening of a larger number of ovarian cancer markers or tumor suppressors. For instance, the 
sequences set forth in Table 1, or a subset of (e.g., 5, 10, 15, 20, 50, 100, 150, 200, 250, 300, 350, 400 
or more) of these sequences, may be provided. By way of example, a provided subset could include 
the markers set forth in Table 6, Table 7, Table 8, Table 9, or Table 10. These sets of sequences are 

20 provided by way of example only, and are not intended to be limiting examples. 

In some embodiments of the current disclosure, kits may also include the reagents necessary 
to carry out screening reactions, including, for instance, RNA sample preparation reagents, 
appropriate buffers (e.g., polymerase buffer), salts (e.g., magnesium chloride), and secondary 
detection reagents (e.g., cyanine 5-conjugated dUTP). 

25 Kits may in addition include either labeled or unlabeled sequences for use in detection of the " 

expression levels. 

Embodiments of the disclosure are illustrated by the following non-limiting Examples. 

EXAMPLE 1 

30 Identification of Genes with Altered Expression in Ovarian Cancer 

This example describes how a first subset of the disclosed ovarian cancer-related nucleic 
acid molecules were identified. These ovarian cancer-related molecules show differences in 
expression in subjects having ovarian cancer compared to normal ovarian surface epithelial cells and 
35 are classified according to their BRCAA, BRCA-2, and sporadic tumor status. The results of these 
studies have been published in Jazaeri et al, J. Natl Cancer Inst, 94(13): 990-1000, 2002, which is 
incorporated by reference in its entirety herein. 
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Methods and Material: 

Clinicopathologic characteristics ofBRCA-linked and sporadic ovarian cancers: Sixty-one 
cases of pathologicaUy-confimied epithelial ovarian adenocarcinoma from the Memorial Sloan- 
Kettering Cancer Center were studied and screened for founder mutations. These included eighteen 
5 cases linked to BRCA1, sixteen cases linked to BRCA2, and twenty-seven sporadic cases. All patients 
were self-identified as Ashkenazi Jews and after informed consent underwent genotyping for germ- 
line founder mutations hxBRCAl (185delAG and 5382insC) and BRCA2 (6174delT) (see Boyd et ai, 
JAMA: 283: 2260-2265, 2000). Those cases with a BRCA mutation were categorized as having 
hereditary ovarian cancer and those without such a mutation as having sporadic ovarian cancer. 

10 Tumor samples: In order to minimize confounding variables, BR CA 1 -linked, BRCA2- 

linked, and sporadic tumors of similar stage, grade, and histology were selected from the sixty-one 
individuals studied [18 BRCA1 (185deIAG, 5382insC), 16 BRCA2 (6174delT), 27 sporadic tumors). 
The majority of tumors in all three groups were characterized by advanced stage, moderate to high 
grade (grade 2 or 3), and a predominance of serous histology. Hence, the clinicopathologic 

15 parameters of selected samples were well-matched and in agreement with those reported previously 
for these tumors types (see Boyd et al, JAMA: 283: 2260-2265, 2000: Ramus et ah, Genes Chrom. 
Cancer: 25:91-96,1999). 

All tumor samples had been flash frozen, embedded in OCT medium, and stored at -80° C. 
Isolation of RNA was performed using the RNeasy columns (Qiagen, Valencia, CA) according to the 
20 manufacturer's instructions. The integrity of RNA was verified by denaturing gel electrophoresis. 
Total RNA was linearly amplified using a modification of the Eberwine method (see Van Gelder et 
al, Proc. Natl Acad. Set U.SA. 87: 1663-1667, 1990). Table 2 catalogs the clinicopathologic 
features of the tumor samples studied. 
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Table 2. Clinicopathologic features of tumor samples 


Variable 


JH*C4i-linked 


J?*G42-Hnked 


Sporadic 


Number of samples 


18 


16 


27 


Median Age* (SD) 


50(11) 


60(9) 


69(11) 


Stage 




I 


2(11.1%) 


0 


0 


H 


0 


2(12.5%) 


3(11.1%) 


HI 


11(61.1%) 


12 (75%) 


24 (88.9%) 


IV 


5 (27.8 %) 


2 (12.5%) 


0 


Grade 




1 


0 


0 


0 


2 


4 (22.2%) 


6(37.5%) 


8 (29.6%) 


3 


14 (77.8%) 


7(43.8%) 


16(59.3%) 


No. 


0 


3 (18.7%) 


3(11.1%) 


Histology** 




Serous 


9 (50%) 


12 (75%) 


, 16 (59.3%) 


Endometrioid 


3 (16.7%) 


0 


2(7.4%). 


Mucinous 


0 


0 


0 


Clear Cell 


2(11.1%) 


0 


0 


Adenocarcinoma NOS 


3 (16.7%) 


3 (18.8%) 


9 (33.3%) 


other 


1 (5.5%) 


1 (6.2%) 


0 


* F test, P - .0002/ Data are the median +/- standard deviation. 

** Chi test P value for differences in histology among tumor groups= 0.17 

NOS = not otherwise specified 



cDNA Microarrays: The cDNA microarrays consisted of 7,651 total features representing 
different (non-redundant) genes, and were manufactured at the National Cancer Institute microarray 
facility. 

5 Total RNA was reverse-transcribed by using a 63 nucleotide synthetic primer containing the 

T7 RNA polymerase binding site (5 

GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGGma^^SEQIDNO: 1). Second 
strand cDNA synthesis (producing double-stranded cDNA) was performed with RNase H, 
Escherichia coli DNA polymerase I, and E. coli DNA ligase (Invitrogen, Carlsbad, CA). After 

10 cDNA was blunt-ended with T4 DNA polymerase (Invitrogen, Carlsbad, CA), it was purified by 
extraction with a mixture of phenol, chloroform, and isoamyl alcohol and by precipitation in the 
presence of ammonium acetate and ethanol. The double-stranded cDNA was then transcribed using 
T7 polymerase (T7 Megascript Kit, Ambion, Austin, TX), yielding amplified antisense RNA that was 
purified using RNeasy mini-columns (Qiagen, Valencia, CA). Pooled total RNA from two SV40 

1 5 immortalized ovarian surface epithelial cell-lines (IOSE) was amplified and used as reference for 
cDNA microarray analysis. 

Four |xg of amplified RNA was reverse transcribed and directly labeled using cyanine 5- 
conjugated dUTP (tumor RNA) or cyanine 3-conjugated dUTP (IOSE RNA, provided by Dr. Jeff 
Boyd, Memorial Sloan-Kettering). Hybridization was performed in a solution of 5X SSC and 25% 
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forrnarnide for 14-16 hours at 42°C. Slides were washed, dried, and scanned using an Axon 4000a 
laser scanner (Axon Instruments, Inc., Union City, CA). 

Imaging and LM.A.G.E. Analysis: Fluorescence intensities at the immobilized targets were 
measured by using an Axon GenePix Scanner and Genepix Pro 3.0 analysis software (Axon 
5 Instruments, Union City, CA). The raw data were then uploaded to a relational database maintained 
by the Center for Information Technology at the National Institutes of Heath. The cDNA clones are 
identified by their Integrated Molecular Analysis of Genomes and their Expression Consortium 
(LM.A.G.E.) clone number. 

Amplification ofRNA: The first strand of RNA was synthesized, by adding 1-3 ug of total 
10 RNA into a reaction tube (e.g., Eppendorf, or other container of suitable size), adding 1 ul T7-(dT) 24 
primer (2^g/ul), and bringing to a volume of 20 ul with nuclease-free water. The reaction was 
incubated at 70°C for 10 minutes, then spun briefly in a centrifuge and placed on ice. Four ul 5X first 
strand cDNA buffer was added, then 2 ul 0.1M DTT, 2ul lOmM dNTP mix (Amersham-Pharmacia, 
Piscataway, NJ 08855-1327 USA), lul RNasin (Promega, Madison, WI 53711 USA), and 2 ul 
15 Superscript II. The reaction was mixed well and incubated at 42°C for 1 hour. The tube was 

centrifuged briefly, and placed on ice. To synthesize the second strand, 91 ul DEPC-treated water 
was added, then 30 ul second strand buffer, 3 ullO mM dNTP mix, 4 ul DNA Polymerase I (10U/ 
ul), 1 ul DNA Ligase (lOU/ul), and 1 ul RNAse H (2U/ul). The final reaction volume equaled 150 
Hi. Next, the tube was gently tapped to mix, then briefly centrifuged. The tube was incubated at 
20 16°C for two hours, then 2 ul (10U) T4 DNA Polymerase was added. The tube was cooled for five 
minutes at 16°C, then the reaction stopped with 10 ul of 0.5 M EDTA. Ten ul of 1M NaOH were 
added, then the reaction was incubated at 65°Cfor 10 minutes. The solution was neutralized by 
addition of 25 ul Tris-HCl (pH=7.5). 

Clean Up of Double Stranded cDNA: The Phase Lock Gel (PLG) was pelleted in a 
25 microcentrifuge at maximum speed for 30 seconds. 198 ul (equal volume) of (25:24:1) Phenol : 

chloroform : isoamyl alcohol (saturated with 10 mM Tris-HCl pH 8.0/1 mM EDTA) was added to the 
final DNA synthesis preparation (198ul) to a final volume of 396 ul. The solution was mixed well by 
pipetting up & down vigorously. The entire cDNA-phenol/chloroform mixture was transferred to the 
PLG tube, and microcentrifuged at maximum speed for two minutes. The aqueous supernatant was 
30 transferred to a new 1.5 ml tube, and 1 ul linear acrylamide was added. 0.5 volumes of 7.5M 

Ammonium Acetate + 2.5 volumes (include the added Ammonium Acetate) of 95% ethanol stored at 
-20 to the sample was added and the solution was vortexed, then centrifuged at maximum speed in a 
microcentrifuge at room temperature for 20 minutes. The supernatant was removed and the pellet 
was washed with 0.5 ml of 80% ethanol. The solution was centrifuged at maximum speed for 5 
35 minutes at room temperature. The 80% ethanol was poured off; and the 80% ethanol wash repeated. 
The pellet was air dried for approximately 15 minutes, then resuspended in 16 ul of nuclease-free 
water. 

In Vitro Transcription: Using an Ambion T7 Megascript kit (Ambion, Austin, TX 78744- 
1832, USA), the manufacturer's instructions were followed to create a. 40 ul reaction (£.*, the 20 ul 
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standard reaction was doubled and incubated at 37°C for 4-5 hours). The reaction was assembled at 
room temperature using 16ul of template double-stranded DNA, to avoid the precipitation of 
spermidine, which can occur if done on ice. Four ul of lOx reaction buffer were added, then 4 ul of 
ATP solution (75mM 17), 4 ul of CTP solution (75mM T7), 4 ul of GIT solution (75mM T7), 4 ul 
5 of UTP solution (75mM T7), 4 ul of Enzyme Mix, and the reaction was incubated at 37°C for 5-6 
hours. Sixty ul nuclease-free water was added to bring the total volume up to 100 ul. The RNA was 
"cleaned" using the "RNA clean-up" protocol provided in the Qiagen RNeasy Mini Handbook, May 
1999, pp. 48-49, Qiagen, Valencia CA. RNA was eluted with 30 ul of nuclease-free water, and the 
optical density ratio was measured (the sample should have an optical density of greater than 1.8 
10 when measured at 260/280 nanometers). The expected yield from this preparation was ten times the 
starting amount of total RNA, and the RNA was then ready for use in generating probe for 
microarrays using total RNA (see below). 

Second Round Amplifications: 0.5-1.0 ug of amplified RNA were resuspended in 1 1 ul 
ultrapure water. 

15 First Strand Synthesis : One ul Random hexamer (1 rag/ml) was added and the 

reaction was incubated at 70°C for 10 minutes, then chilled on ice, then allowed to equilibrate at 
room temperature for 10 minutes. Four ul 5X First strand cDNA buffer, 2 ul 0. 1M DTT, 2 ul lOmM 
dNTP mix, 1 ul RNasin were added, and the reaction was mixed incubated at 42°C for 2 rninutes. 
Two ul Superscript II were added, and the reaction was mixed well and incubated at 42°C for 1 hour. 

20 One ul RNAse H was added, and the reaction was incubated at 37°C for 20 minutes, then heated to 
95°C for 2 rninutes to quell the reaction, then chilled on ice. 

Second Strand Synthesis : One ul T7-oligodT primer (0.5 mg/ml) was added, and 
the reaction was incubated at 70°C for 5 rninutes and at 42°C for 10 minutes. Then, 91 ul DEPC 
treated H 2 0 were added, then 30 ul Second strand buffer, 3 ullO mM dNTP mix, 4 ul DNA 

25 Polymerase I (10U/ ul), 1 ulDNA Ligase (lOU/ul), and 1 ul RNAse H (2U/ul) to a final volume of 
150 ul. The tube was tapped gently to mix, then briefly centrifuged. The reaction was incubated at 
16°C for two hours, then 2 ul (10U) T4 DNA Polymerase were added, and the reaction was cooled 
for 5 rninutes at 16°C. The reaction was stopped with 10 ul of 0.5 M EDTA, then 10 ul of 1M NaOH 
were added. The reaction was incubated at 65°C for 10 minutes, then neutralized by addition of a 

30 solution with 25 ul Tris-HCl (pH=7.5). The protocol for "Clean Up of Double Stranded cDNA" and 
*Tn Vitro Transcription" was followed to generate cDNA for use in preparation of the probe for 
microarray hybridization. 

Preparation of Probe and Microarray Hybridization Using Amplified RNA: To prepare the 
probe, reverse transcription labeling reaction mixes were created for each probe containing the 

35 component Random Primer (InVitrogen, Carlsbad, California 92008, USA). Three ug/ul in 2ul were 
added, then 5-6 fig amplified RNA. The reaction was brought to a final volume of 17 ul with water, 
then incubated at room temp for 1 0 minutes. To each probed, 5X first strand buffer in 8 ul, 20X 
lowT-dNTP mix in 2 ul, 0.1 M DTT in 4 ul, RNAsin in 1 ul, Cy-3 or Cy-5 dUTP (NEN Life Science, 
Boston, MA 021 18-2512 USA) in 4 ul, and Superscript!! (GIBCO-BRL, InVitrogen Corporation, 
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Carlsbad, California 92008 USA) enzyme in 2 ul was added The reaction was incubated at 42°C for 
60 minutes, then 5 ul of 500 mM EDTA and 10 ul of 1M NaOH were added. The reaction was 
incubated at 65°C for 15 minutes to hydrolyze residual RNA, then cooled to room temperature. 
Twenty-five 25 ul of 1 M Tris-HCl (pH7.5) was added to neutralize pH. 
5 Probe Cleanup: 500 ul of IX TE were added to a Microcon-YM30 column and the column 

was spun at 13,000 rpm for 5-6 minutes to wash the column. Membrane integrity was checked by 
looking into the top insert, to confirm that a thin film of TE (-50 ul) covered the membrane. 400ul 
IX TE was added to each of the sample tubes and all contents were transferred to the washed 
Microcon-YM30 column (Amicon, Millipore Corp., Bedford, Massachusetts). The column was spun 
10 at 13,000 rpm for 5-6 minutes until approximately 50 ul was left on the membrane. The column was 
checked for dye crystals along the edge of the column membrane, which indicated that the probe was 
likely to be good. 450 ul IX TE were added to the column and the column was spun down to -50 ul 
as above. The presence of crystals was confirmed. The Cy-3 labeled probe was placed into a clean 
tube, and the column was spun at 14,000 rpm for 1 minute to elute the probe. The Cy-3 labeled probe 
15 was added to the Cy-5 labeled probe in the column, and approximately 450 ul IX TE was added to 
the column. The column was spun at 13,000 rpm until approximately 13-14 ul of combined probe 
remained on the membrane, which was checked with a pipette. The combined probe was inverted 
into a clean tube, and spun at 14,000 for 1 minute to elute. The probe (14 ul) was transferred into a 
clean Eppendorf tube and stored at 4 °C until used in the hybridization reaction. 
20 Probe Hybridization: Twenty ul of water were added to each humidifying well in the 

Hybridization Chamber (to maintain humidity). Then, 40 ul of prehybridization buffer (5X SSC, 
0.1% SDS, 1% BSA (Sigma) warmed to 42°C) were placed in the center of the slide and the cover 
slip was placed on the slide, taking care to prevent bubble accumulation beneath the slip. The margin 
clamps on the Hybridization Chamber were firmly attached, and the chamber was incubated at 42 °C 
25 for least 1 hour. The slide was washed in distilled water for 2 minutes, followed by isopropanol for 2 
rninutes. The slide was dried in a centrifuge (5804R, Eppendorf) at 705 rpm (~70x g) for 4 minutes, 
then prepared for hybridization as discussed above. The slide was hybridized within 1 hour of the 
prehybridization step. Two 2 ul COT1-DNA (Hoffman La Roche, Nutley, New Jersey 071 10 USA) 
(1 ug/ul), 2 ul polyA (Sigma) (8-10 jig/ul), and 2 ul yeast tRNA (Sigma, Ronkonkoma, NY 1 1779 
30 USA) (4 ug/ul) were mixed with the probe. Then, the probe was denatured for 1 minute at 1 00°C, 
placed briefly on ice to cool the reaction, and spun down in a centrifuge. Twenty ul of 2X 
hybridization buffer (50% formamide, 10X SSC, 0.2% SDS, warmed to 42°C) were added to the 
denatured probe, mixed well (taking care to mmimize bubble formation) and kept at 42°C until ready 
to spot on the slide. The hybridization chamber was prepared as in the prehybridization step with 
35 20ul of distilled water in each well. The slides were placed face-up in the chambers, and the probe 
was hybridized with the slide for 14-16 hrs at 42'C. 

Slide Washes: The margin clamps on the Hybridization Chamber were carefully removed to 
prevent water from seeping in and contaminating the array. The slide was removed from the 
chamber, held with forceps and the cover slip allowed to fall off into the solution containing 2X SSC, 



WO 03/068054 



PCT/IIS03/04688 



10 



37 

0.1% SDS. The slide was washed for 4 minutes in IX SSC, 0. 1% SDS, for 4 minutes in 0.2X SSC, 
and for 1 minute in 0.05X SSC. The slide was spun dry in a centrifuge at 705 rpm (approximately 
70x. g) for 4 minutes. If water droplets were seen on the slide, it was spin again for another 4 
minutes. Exposure to light was minimized by placing the dried slides in a slide box until ready for 
scanning. 

Statistical Analysis-. The logarithmic expression ratios for the spots on each array were 
normalized by subtracting the median logarithmic ratio for the same array. Data were filtered to 
exclude spots with size less than 25 urn, intensity less than 2 times background or less than 300 units 
in both red and green channels, and any poor-quality or missing spots. In addition, any features 
found to be missing or flagged in greater than 10% of the arrays were not included in the analysis. 
Application of these filters resulted in the inclusion of 6,445 of the total 7,651 features in subsequent 
analyses. Statistical comparison between tumors groups was performed using the "BRB Array 
Tools" software (developed by Dr. Richard Simon and Amy Peng, Biometrics Research Branch, 
Division of Cancer Treatment and Diagnosis, NCI, USA). A modified F-test is run on each gene's 
15 log-ratio values, and the significance of that gene is determined with P < 0.0001 considered 
significant. This stringent P value is selected in lieu of the Bonferroni correction for multiple 
comparisons, which was deemed excessively restrictive (Figure 1) (see Bland and Altman, B.M.J.; 
310: 170, 1995). 

Semi-quantitative PCR: Five samples from each tumor group (BRCA-l, BRCA-2, sporadic) 
were selected at random. For each sample, 3.5 ug of total RNA was reverse-transcribed using oligo 
dT primers and 400 units of Superscript II reverse transcriptase (Invitrogen, Carlsbad, CA) in the 
presence of all four deoxyribonucleoside S'-triphosphates (each at 10 mM) (InVitrogen, Carlsbad, 
CA) and 40 units of RNAse inhibitor (Promega, Madison, WI). Reverse transcription was performed 
in a total reaction volume of 40 ul, of which 1 pi was subsequently used for each PCR reaction. 
25 Preliminary experiments were perfoimed to identify optimal cycle number for each gene. Thirty 
cycles was found to be optimal amplification for all amplified RNAs except for HLA-DRB1 and 
CD74, which were amplified for 26 cycles. Polymerase chain reaction was performed using the 
GeneAmp PCR kit (PE Applied Biosystems, Foster City, CA) according to the manufacturer's 
instructions. Representative gene specific primer sequences are shown in Table 3: 



20 
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Table 3. Primer pairs for RNA amplification. 



Gene specific primer pair 


SEQ ID NO. 


ACTB 


5 ^GGTGGCTTTTAGGATGGCAA-3 ' 


SEQ ID NO: 2 
SEQ ID NO: 3 




5 ^TTCGGCTTCACCCTAGTCTCA-3 * ' 
5'-AGAGGGAATACAGAGTCCCGAA-3' 


SEQ ID NO: 4 
SEQ ID NO: 5 


ZFP36 


5 1 -ACCCTGATGAATATGCCAGCA-3 1 
5 , -GCTACTTGCTTTTGGAGGGTA-3 ' 


SEQ ID NO: 6 
SEQ ID NO: 7 


RGS1 


5*-GACTCnATCCCAGGTTCCTCA-3 1 
5 '-TGACTCCCTGGTTTTAAGAGC A-3 1 


SEQ ID NO: 8 
SEQ ID NO: 9 


CD74 


5 '-CCAGTCCCC ATGTGAGAGCA-3 ' 
5 , -AGCTGATAACAAGCTrGGCTGA-3' 


SEQ ID NO: 10 
SEQ ID NO: 11 


TOP2A 


5 '-TGTCCCTCCACGAGAAACAGA-3 ' 
5 '-CGTACAGATTTTGCCCGAGGA-3 ' 


SEQ ED NO: 12 
SEQ ID NO: 13 


HLA-DRBJ 


5 '-GCGAGTTGAGCCTAAGGTGA-S' 
5 '-TTGAAGATGAGGCG CTGTCA-3 » 


SEQ ID NO: 14 
SEQ ID NO: 15 



10 



Amplified RT-PCR products were visualized on an agarose gel stained with ethidium bromide. The 
intensity of each band was an indicator of the quantity of DNA, as previously amplified by PCR. 
Thus, the intensity served as an indirect measure of the starting amount of the KNA amplified from 
the respective gene in each sample. Intensity was quantified using an ultraviolet light source and 
Alpha Imager software (Alpha Innotech Corp, San Leandro, CA). In addition to the above- 
mentioned tumor samples, sqRT-PCR evaluation of selected genes was also performed on the IOSE 
RNA for comparison. 



Results 

Global assessment ofzene expression dif ferences amone tumor grgugsi Prior to 
investigating specific inter-group differences, the overall patterns of gene expression in the three 
tumor types (BRCA-1, BRCA-2, sporadic) were assessed. Multidimensional scaling (MDS), based on 

15 the expression levels of all 6,445 filtered genetic elements in (he microarray, revealed that BRCA1- 
and BflG42-linked tumors have distinct molecular profiles. In contrast, the sporadic samples showed 
a more heterogeneous distribution pattern, with many patterns clustering near the patterns of BRCA 1- 
linked or BRCA21tiked samples (Figure 1A). The MDS results suggested that the BRCA J- and 
£tf G42-associated groups would be the most different and that gene expression patterns for each of 

20 the BRCA groups and the sporadic tumors would have fewer differences. In support of this 

hypothesis, only a few genes showed statistically significant (PO.0001) differential expression 
between the sporadic tumors and the BRCA J- oxBRCA2Amkcd tumors, whereas 1 10 genes were 
differentially expressed between BRCAUiriaA and 5* G42-linked tumors (Figure IB). In addition 
34 EST sequences were differentially expressed between BRCA J- and BRCA2-Unkcd tumors. The 

25 group of 144 total markers that were differentially expressed between BRCA1- and BRCA2-type 
tumor cells compared to normal ovarian epithelial cells is listed in Table 9 (see Addendum). 

Differential gene expression among all three groups was also performed, which identified 60 
genes and 3 EST sequences whose expression segregated BRCA /-linked, 5tfC«-linked, and 
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sporadic tumors (modified F test, wherein P<0.0001). Fifty-one of these 63 genes and EST 
sequences were also among the statistically significant discriminators of BRCAl and BRCA2 tumors, 
highlighting the distinct gene expression profiles of these two groups. In addition, the expression 
profile of the combined BRCA 1- and£/*G42-linked group was remarkably similar to that of the 
5 sporadic tumors, as demonstrated by only three genes showing differential expression (PO.000 1 ) 
between these groups [PSTPIP1 (SEQ ID NO: 538-540), IDH2 (SEQ ID NO: 541-542), and PCTKJ 
(SEQ ID NO: 527-528)]. These observations were in agreement with the multidimensional scaling 
analysis and demonstrated that, in terms of the overall pattern of gene expression, the BRCAl- and 
BRCA2-livked tumors are distinct from one another. Furthermore, the gene expression profiles of the 
10 sporadic tumors appear to share features of either BRCA J- or BRCA2-lihked cancers, and these 
sporadic tumors are referred to herein as BRCAl-typc or BRCA2-typz sporadic ovarian tumors. 

The group of 144 nucleic acid molecules listed in Tahle 9 was further investigated using 
hierarchical clustering (Figure 2A, B). As expected, the ARG4-associated tumors showed distinct 
and contrasting expression profiles (Figure 2A). Strikingly, the sporadic samples also segregated into 
15 two groups based on the expression patterns of the same 144 genes, exhibiting sporadic sample had a 
molecular profile similar to mat of either the BRCAl- or the &RG42-linked tumors. This observation 
was illustrated by hierarchical clustering of all samples, revealing distinct "BRCAl -type" and 
"BRCA2-type" clusters (Figure 2A). This clustering further demonstrates that sporadic tumors 
(which do not contain the BRCA1 or BRCA2 mutations) can often be classified as BRCAl-type or 
20 BRCA2-typt. Classification of sporadic tumors into these subtypes may provide guidance in treating 
the patient For example, a subject who has a BRCA J -typo or BRCA2-type sporadic tumor may be 
treated similarly to a subject who has a BRCA J -linked or BRCA2-hhkod tumor. The identification of 
BRCAl- ovBRCA2-type sporadic tumors also allows tumors (or subjects) to be selected for specific 
drug regimens that are particularly effective with the associated mutation type. 
25 Color-coding is usually used to represent the relative transcript expression ratio, as measured 

by cDNA microarray analysis. Red customarily indicates the maximum point in gene expression, 
green the minimum, and levels closer to the mean approach black 

To ensure that the 2?2* G4-linked samples were not biasing the observed clustering patterns, 
the hierarchical architecture of gene expression in sporadic tumors was examined separately. Even in 
30 the absence of the BRCAAinked samples, two distinct cluster phenotypes were observed, each 
comprised of those sporadic samples that previously grouped with BRCAl - and ARG42-linked 
tumors (Figure 2B). Tumor histology and patient age were also evaluated for possible confounding 
effects on the observed BRCA J -type and BRCA2-type clusters. Neither variable was found to 
influence clustering patterns (Figure 2A, 2B). 
35 Genes differentially expressed between BRCAl- and BRCA2-Iinked ovarian carcinomas: 

The analysis of overall gene expression patterns established that the same genes whose expression 
differentiated BRCAl and BRCA2Awked tumors, also identified two major sub-populations of 
sporadic cancers (Figure 3). As such, these nucleic acids are believed to represent important 
mediators of common genetic pathways in ovarian cancer and/or carcinogenesis. Many of these 
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genes are involved in important cellular functions including signal transduction, RNA processing and 
translation, chemokine signaling and immune modification, and DNA repair. By way of example, the 
BRCAJ -associated tumors were characterized by higher AKT1 (SEQ ID NO: 504-506) and lower 
PTEN (SEQ ID NO: 507-509) relative expression. In addition UBLI (SEQ ID NO: 510-512) (also 
5 known as SUMO-l and sentrin) was more highly expressed in BRCA1- associated tumors. This 
molecule interacts with RAD51 and RAD52 and has been proposed to have a regulatory role in 
homologous recombination (see Li et al, Nuc. Ac. Res: 28: 1 145-1 153, 2000). The preferential 
expression of UBL1 (SEQ ID NO: 510-512) in the £*G47-linked samples may prove to be relevant 
to possible differences in DNA repair actions of the BRCA tumor suppressor genes. 
1 0 By way of example, the BRCA2-linked tumors showed higher relative expression of WNT2 

(SEQ ID NO: 513-514 and SFRP4 (SEQ ID NO: 515-517), which are members of the wnt-p-catenin- 
TCF signaling pathway. Another notable observation is that both BRCA1- and BRCA2-linktd tumors 
showed preferential expression of proto-oncogenes commonly altered in hematologic malignancies. 
BRCA1 tumors showed higher expression levels of RUNXl(SEQ ID NO; 518-520)/AMLl, while 
15 AKG42-associated samples showed preferential expression of TALI (SEQ ID NO; 521-523)/SCL. 
Both of these oncogenes are transcription factors involved in proliferation, and their preferential 
expression in BRCAI- and 2?/?G42-linked tumors may indicate that the activation of such a 
proliferation driver" is a necessary step in ovarian carcinogenesis. 

Gene expression differences between BRCA-linked and sporadic tumors: Nine non- 
20 redundant genes showed significant differential expression between BRCAl-lwked and sporadic 
tumors [CD72 (SEQ ID NO: 805), SLC25AU (SEQ ID NO: 544), LCN2 (SEQ ID NO: 545-547), 
PSTPIPI (SEQ ID NO: 538-540), SIAHBP1 (SEQ ID NO: 543), UBE1 (SEQ ID NO: 533), WAS 
(SEQ ID NO: 524-526), IDH2 (SEQ ID NO: 541-542), PCTK1 (SEQ ID NO: 527-528), PO.0001, 
Figure 4A . A noteworthy observation was that three of these genes, WAS (SEQ ID NO: 524-526), 
25 PCTK1 (SEQ ID NO: 527-528), and UBE1 (SEQ ID NO: 533), have all been mapped to the Xpl 1.23 
and all were higher expressed in the BRCA /-linked tumors. This observation seemed unlikely to be 
explained by chance alone as only 35 of the total 6,445 filtered spots (0.5%) on the microarray 
represent genes mapped to Xpl 1. To further investigate this pattern, a larger group of 53 genes was 
considered for differential expression between MC4/-linkedand sporadic tumors under the less 
30 stringent significance level of PO.001. Among this group three additional genes, SMC1L1 (SEQ ID 
NO: 530), ARAF1 (SEQ ID NO: 531-532), and EBP (SEQ ID NO: 529), were discovered that also 
mapped to the Xpl 1 .23 locus and also showed higher mean expression in BRCAI -associated samples 
(FIG 4D). Thus, six of fifty-three genes differentially expressed between Zy07-linked and sporadic 
samples (P<.001) mapped to Xpl 1 .23 and all showed higher mean expression in iy?G4i-linked 
35 tumors. In silico analysis of the location of these genes revealed that they are all confined to a 5-Mb 
region of DNA in Xpl 1.23 (Ensemble database, Prous Science, Philadelphia, PA 19102, U.S.A.). 

The comparison between BR G42~linked and sporadic tumors revealed only two genes with 
differential expression among these groups at the significance level of P<0.0001 (Figure 3). The 
gene designated as LOC51760 (SEQ ID NO: 534-535) is also known as B/K (encoding the 
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brain/kidney protein) and is moderately homologous to the synaptotagmin family of vesicular 
transport molecules. The second differentially expressed gene encodes low-density lipoprotein- 
related protein-associated protein 1 (LRPAP1), also known as alpha-2-macroglobulin receptor- 
associated protein 1. 

5 A further comparison consisted of investigating gene expression differences between the 

combined BRCA-lmked group and the sporadic group, which revealed only three non-redundant, 
differentially expressed genes [PSTPIP1 (SEQ ID NO: 538-540), IDH2 (SEQ ID NO: 541-542), and 
PCTK1 (SEQ ID NO: 527-528), Figure 4C. All three genes were among the group of genes that 
differentiated BRGA1 -linked and sporadic samples. This finding is consistent with the observation 

1 0 that the KNA profiles of sporadic ovarian cancers share significant similarities with those of BRCA1- 
linked or BRCA2-lmked tumors. It is believed that the similarities shown in the RNA profiles is a 
general characteristic that applies to gene and protein component profiles as well. The small number 
of differentially expressed genes obtained from the comparison of the combined BRCA group to the 
sporadic tumors is the result of the latter also consisting of BRCAl-typc and£#C42-type molecular 

1 5 classes. 

Gene expressio n features distinguishing ovarian cancers from ovarian surface epithelial 
cells: Gene expression patterns common among all tumor types were investigated to identify genes 
that may be associated with the transformed state, i.e., genes commonly expressed in ovarian tumors 
irrespective of their hereditary or sporadic nature. Gene expression in all sixty-one primary tumor 
20 samples was compared to immortalized ovarian surface epithelial (IOSE) cells used as the common 
reference. Using the selection criterion of two-fold or greater expression ratio relative to the IOSE 
reference in at least two-thirds of all tumors, a list of 201 non-redundant genes and ESTs was 
generated. The top twenty-five overexpressed (IL8 (SEQ ID NO: 449-451), GROl (SEQ ID NO: 
452-453), ALDH1A3 (SEQ ID NO: 454-456), MMPI (SEQ ID NO: 457-459), OSF-2 (SEQ ID NO: 

25 460-461), CDC25B (SEQ ID NO: 462-464), FLNA (SEQ ID NO: 465-467), TFP12 (SEQ ID NO: 
468-469), FGF2 (SEQ ID NO: 470-472) , CD44 (SEQ ID NO: 473-475), DYT1 (SEQ ID NO: 476- 
477), UCHL1 (SEQ ID NO: 478), FGF2 (SEQ ID NO: 470-472), PLA U (SEQ ID NO: 479-480), 
LDHA (SEQ ID NO: 256), PTGS2 (SEQ ID NO: 481-483), PRNP (SEQ ID NO: 484-486), MT1X 
(SEQ ID NO: 487-488), UGB (SEQ ID NO: 489-490), PBEF (SEQID NO: 491-493), TXNRD1 (SEQ . 

30 ID NO: 494-496), NT5 (SEQ ID NO: 497-499), PTGS2 (SEQ ID NO: 481-483), MT2A (SEQ ID NO: 
500-502), ZNF220 (SEQ ID NO: 503)) and twenty-five down-regulated (FLJ221 74 (SEQ ID NO: 30- 
31). DDR1 (SEQ ID NO: 74-76), SERPINF2 (SEQ ID NO: 18-19), HLA-DRB1 (SEQ ID NO: 87-88), 
IFITM2 (SEQIDNOS: 55-57, 58-59), HGF (SEQID NO: 174-175), SORL1 (SEQ ID NO: 149-151), 
CP (SEQ ID NO: 83-84), HLA-DRA (SEQ ID NO: 94-96), BRF2 (SEQ ID NO: 190-192), ABCB1 

35 (SEQ ID NO: 164-166), G1P3 (SEQ ID NO: 68-69), RGS1 (SEQ ID NO: 122-123), IFITM1 (SEQ ID 
NOS: 50-51, 52-54), FOS (SEQID NO: 133-135), PPP1R7 (179-180), HLA-DPA (SEQID NO: 97- 
99), HLA-DRB5 (SEQ ID NO: 85-86), TLR3 (SEQ ID NO: 199-201), ZFP36 (SEQ ID NOS: 167-168, 
169-171, 172-173), SGK (SEQ ID NO: 176-178), HLA-DRB1 (SEQ ID NO: 87-88), HE4 (SEQID 
NO: 60), CD74 (SEQ ID NO: 89-91, CD24 (SEQ ID NO: 181-182)) named genes (by order of 
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magnitude) are presented in Figure 4A and 4B. This analysis revealed two potentially significant 
functional groups of genes to be overexpressed in ovarian cancers. The first group consisted of 
several of the genes that have all been previously shown to be interferon-inducible (HLA-DRB 1 (SEQ 
ID NO: 87-88), HLA-DRB5 (SEQ ID NO: 85-86), HLA-DRA (SEQ ID NO: 373-374), HLA-DPA 
5 (SEQ ID NO: 97-99), CD74 (SEQ ID NO: 89-91), IFITM1 (SEQ ID NOS: 50-51, 52-54), and 
IFITM2 (SEQ ID NOS: 55-57, 58-59), as indicated by italics in Figure 4A and 4B). The second 
group consisted of immediate-early response genes (BRF2, ZFP36, SGK, and FOS). In addition, 
several genes previously reported to be overexpressed in ovarian epithelial tumors were present in the 
list of genes overexpressed in tumors relative to the IOSE cells (Figure 4A and 4B). Elevated levels 
1 0 of CLU, CD24, and MUC1 were also observed. These results identify additional potential markers of 
ovarian cancer. Table 9 lists the 144 nucleic acids that showed significantly elevated expression in 
ovarian cancer. These genes were selected based on consistency across all the pooled experiments 
and a significant difference in the average expression in the 40 independent samples, using a criteria 
of a tumor-to-ovarian surface epithelial cell line ratio of two or greater in at least 66% of all tumors. 

15 

EXAMPLE 2 

Semiquantitative RT-PCR confirms and complements cDNA microarray data 

This example describes how the results found in the previous example were confirmed using 
20 semiquantitative RT-PCR. 

To validate the array data, semiquantitative RT-PCR (sqRT-PCR) analysis of several 
mRNAs was performed in a representative subset of tumors consisting of five ^i?G4i-linked, five 
BRCA2-liriked, and five sporadic RNA samples. The tumor samples were randomly selected. The 
25 * expression of TOP2A (SEQ ID NO: 448), RGS1 (SEQ ID NO: 398, CD74 (SEQ ID NOS: 89-91, 92- 
93), HE4 (SEQ ID NO: 60), HLA-DRB1 (SEQ ID NO: 87-88), and ZFP36 (SEQ ID NO: 167-168, 
169-171, 172-173) were evaluated using sqRT-PCR, with P-actin as a normalizing control. Because 
data obtained from cDNA microarrays is in the form of relative expression ratios between tumors and 
the reference, RNA from IOSE cells and a histologically normal, postmenopausal ovarian RNA 
30 sample in the sqRT-PCR experiments was included for comparison. The results of these sqRT-PCR 
experiments were consistent with the cDNA microarray relative expression data for all six genes 
evaluated (Figure 5A and 5B). As anticipated from the microarray results (Figure 4), HE4 expression 
was consistently elevated in all fifteen tumor samples compared to IOSE reference cell-line and 
normal ovary (Figure 5A and 5B). Invariant chain genes, also known as CD74 and RGS1, were 
35 overexpressed in the majority of tumors as indicated by microarray analysis (Figure 4). Both were 
also found to have increased expression in the majority of tumors as evaluated by sqRT-PCR (Figure 
5A and 5B). The expression of TOP2A was found to be highest in the reference IOSE RNA. 
Furthermore, compared with the expression level in the normal postmenopausal ovary, twelve of the 
fifteen tested tumor samples showed elevated and variable TOP2A gene expression. 
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Several members of the immediate-early response cascade showed elevated expression in 
tumors as compared to IOSE cells in the microarray experiments (as indicated by the notation * in 
Figure 4A and 4B); however, some of these genes have previously been shown to have lower 
expression in ovarian cancer compared to normal ovary (see Welsh etal. 3 Proc. Natl Acad. Set 
U.S.A. 98: 1176-1181, 2001, and Wang et al. y Gene 229: 101-108, 1999). 

This discrepancy suggested that the elevated relative ratio observed in these experiments 
may be driven by low expression levels of these genes in IOSE cells grown in culture. In order to test 
this hypothesis, sqRT-PCR was used to compare ZFP36 (an immediate-early gene also known as 
G0S24 and Tisl 1) expression in tumors to that of normal ovary and IOSE cells. As suspected, 
normal ovary had one of the highest expression levels of ZFP36, followed by that of the majority of 
tumors (Figure 5A), while the lowest expression level was observed in the IOSE cells. 

In addition to statistical analysis, multidimensional scaling (MDS) and hierarchical 
clustering techniques using a correlation metric and average linkage were used for evaluating overall 
gene expression (see Eisen et al, Proc. Natl Acad. Sci. U.SA. 95: 14863-14868, 1998). 



EXAMPLE 3 

Identification of Additional Genes with Altered Expression in Ovarian Cancer 

This example provides a description of how additional disclosed ovarian cancer-related 
nucleic acid molecules were identified. Triese ovarian cancer-related molecules show differences n 
expression in subjects having ovarian cancer compared to expression in normal ovarian surface 
epithelial cells. 



25 Using a different microarrays and methods essentially similar to those described above in 

Example 1, thirty-one ovarian epithelial cancers were compared to two normal postmenopausal 
ovarian samples. 141 additional ovarian cancer-related nucleic acid molecules were identified and 
further characterized (Tables 6 and 7, Addendum). 

30 Methods and Materials: 

Methods and materials were similar to those described in Example 1, except that different 
microarrays were used. The nucleic acids constituted 7,600 features, and representing different (non- 
redundant) transcripts including multiple known named genes and ESTs. The cDNA microarrays 
were constructed by Dr. Eric Chuang (Division of Radiation Oncology) at the Advanced Technology 

35 Center (Gaithersburg, MD 20877). The genes represented on these arrays are composed of 7,600 
cDNA clones and ESTs and are commercially available (Research Genetics, 2130 Memorial 
Parkway, Huntsvillle, AL 35801, U.S). 

The nucleic acid molecule expression patterns of thirty-one ovarian epithelial cancers were 
compared to two normal postmenopausal ovarian samples. The tissues were analyzed once, as the 
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con-elation coefficient from previously repeated array experiments was shown to be 0.92-0.95. Each 
tumor and normal sample was directly compared to a "reference RNA" consisting of a mix of nine 
different human cell lines (Stratagene, La Jolla, CA), allowing for indirect comparison of gene 
expression in tumors and normal ovarian samples. 
5 Hierarchcal clustering was performed as described above and as set forth in Eisen et al, 

Proc. Natl. Acad. Set U.S.A. 95: 14863-8, 1998. 

Results 

Systematically Altered Genes 
10 Using these methods, two additional sets of nucleic acid molecules were identified that 

showed differential expression in subjects having ovarian cancer. Table 4 (see Addendum)provides 
a list of nucleic acid molecules that were found to be underexpressed in subjects having ovarian 
cancer, and their average gene log expression ratios. Table 5 (see Addendum) shows nucleic acid 
molecules that were found to be overexpressed in persons having ovarian cancer, and their average 
1 5 gene log expression ratios. 

Genes underexpressed in ovarian tumors (see Table 4) may represent potential tumor 
suppressors. The induction of the expression of these genes through therapeutic means, for instance 
by induction through drug or gene therapy, may slow tumor growth and/or increase tumor cell death. 
Among the 100 underexpressed genes were several oncogenes coding for proteins that are 
20 normally associated with the process of malignant transformation, 8nciuding JUN (SEQ ID NO: 137- 
138;, KIT (SEQ ID NO: 298-299), and MAF (SEQ ID NO: 229-230). The lower expression of these 
genes in cancers compared to expression in non-cancerous subjects is unexpected and is believed to 
reflect novel effects unique to ovarian cancer. Additionally, CDKN1C(SEQID NO: 249), NBLI 
(SEQ ID NO: 273), and ING1L (SEQ ID NO: 322) are recognized tumor suppressors, the 
25 dowhregulation of which may be involved in the process of tumor formation and/or progression. 
JGFbeta cascade members TGFBR3 (SEQ ID NO: 2 1 6-218) and EBAF (SEQ ID NO: 294) (both 
shown herein to be underexpressed in ovarian cancer) present potential interest in light of the recent 
implication of the TGF beta pathway in normal and oral contraceptive-induced ovarian epithelial cell 
death and turnover (see Rodriguez et al, J. Natl Can. Inst. 94(1): 50-60, 2002). Thus, 
30 downregulation of these nucleic acids may lead to inappropriate growth and possible transformation. 

Genes that were overexpressed in ovarian tumors (Table 5) compared to normal tissue are 
believed to represent suitable targets for therapy and/or diagnosis, prognosis and staging of ovarian 
cancer. The decrease of the expression of these genes through therapeutic means, for instance by 
drug or gene therapy, presents a potential method of inhibition of ovarian cancer. 
35 Among the fifty-nine overexpressed genes were several genes coding for proteins that are 

believed to be particularly promising as gene targets, including the following: SLPI (Secretory 
leukocyte protease inhibitor) (SEQ ID NO: 340-341); SPP1 (Secreted phosphoprotein 1) (SEQ ID 
NO: 342); CKS1 (CDC28 protein kinase 1) (SEQ ID NO: 345-347); ZWINT(ZW 10 interactor) (SEQ 
ID NO: 354); 5^(B-factor, properdin) (SEQ ID NO: 343-344); MMP7 (Matrix metalloproteinase 7) 
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(SEQ ID NO: 348-349); FOLR1 (Folate receptor 1) (SEQ ID NO: 364-365); KLK8 (Kallikrein 8) 
(SEQ ID NO: 368; CRIP1 (Cysteine-rich protein 1) (SEQ ID NO: 375; EYA2 (Eyes absent) (SEQ ID 
NO: 392-393); andPAX8 (Paired box gene 8) (SEQ ED NO: 350-351). 

SLPIis a particularly promising candidate as a potential ovarian cancer marker or detector. 
5 This protein has also been shown to be overexpressed in lung cancer (see Ameshirna et al, Cancer 
89(7): 1448-1456, 2000) and is detectable in the saliva, enabling non-invasive testing (see Shugars et 
al, Gerontology, 47(5): 246-253, 2001). MMP7 over-expression has been described in primary and 
metastatic gastric cancers (see Mori et al, Surgery, 131(1 Pt 2): S39-S47, 2002) as well as colorectal 
carcinomas (see Ougolkov et al, Gastroenterology. 122(1): 60-71, 2002). MMP7 appears to be 
10 involved in new blood vessel formation, which is a prerequisite for tumor growth (see Nishizuka et ' 
al, Cancer Lett. 173(2): 175-182, 2001). SPP1 (otherwise known as osteopondin) has also been 
associated with a number of malignancies (see Fedarko et ah, Clin. Cancer Res. 12: 4060-4066, 
2001) including a recent report showing higher expression in ovarian cancer (see Mok et al, J. Natl. 
Cancer Inst 93(19) 1458-64, 2001). ZWINT is a newly discovered protein involved in kinetochore 
15 binding and centromere function (see Starr et al, J. Cell Sci. 1 13(Pt 1 1): 1939-1950, 2000). 

Properdin is involved in immune function and encodes complement factor B, a component of the 
alternative pathway of complement activation. CRIP1 is believed to be involved in zinc transport. 
Kallikrein 8 (also TADG14) is normally expressed in neural tissue, but appears to be altered such that 
it is highly expressed in ovarian cancers (see Underwood et al, Cancer Res. 59(17): 4435-4439, 
20 1999). EYA2, named for its involvement in eye development, is an important developmental gene 
that is potentially important in ovarian cancer. EYA2 is located on the 20ql3 chromosomal locus, 
which is the most frequently amplified chromosome region in ovarian cancers (see Tanner et al, 
Clin. Cancer Res. 5: 1833-1839, 2000). Other genes localized to the same 20ql3 chromosomal 
region are BMP7, which is also involved in development, and SLPI (discussed above), as well as HE4 
25 (identified in Example 1 , above), all of which show higher expression in ovarian tumors. Thus; the 
upregulation of these nucleic acids may in part be due to amplification of 20ql3 in the tumors 
studied. 

PAX8 is involved in thyroid differentiation and normal function (see Pasca et aL, Proc. Natl 
Acad. Sci. U.S.A. 97(24): 13 144-13 149, 2000). Furthermore, the folate receptor has been shown to be 

30 overexpressed in ovarian cancer (see Hough etal, Cancer Res. 61(10): 3869-3876, 2001 and Bagnoli 
et al, Oncogene, 19(41): 4754-4763, 2000). Finally, the specific pattern of caveolin (CA VI) under- 
expression and Folate receptor (FOLR1) over-expression disclosed herein (see Tables 4 & 5, 
Addendum) is consistent with the reciprocal regulation of the expression of genes in ovarian cancer 
(see Bagnoli et al, Oncogene, 19(41): 4754-4763, 2000). 

35 For each of the above specifically enumerated genes, a survey of the Serial Analysis of Gene 

Expression (SAGE) database (available through the UniGene search engine on the National Center 
for Biotechnology Information website) revealed that the expression of these genes is limited to a 
relatively small number of tissues, including ovarian cancers and some other tumors, for instance 
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pancreatic or breast In addition, SLPI and SPP1 are secreted proteins that may be detectable as i 
diagnostic marker in tbe serum of a subject. 
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EXAMPLE 4 

Classification of a Tumor Into BRCAl-Mnked or BRCA2-\inked tumor class. 

This example describes how to classify a tumor into a BRCAl-tike or BRCA2A&Z tumor 
5 type using compound covariate prediction analysis. 

Class prediction can be performed using a.Compound Covariate Predictor tool, available as 
part of the BRB Array Tools software provided for download on the National Cancer Institute 
Internet website. Detailed information about the Compound Covariate Predictor is provided by the 
1 0 Biometric Research Branch, National Cancer Institute and can be found in the following technical 
reports listed at that site" McShane etaL, 'Methods for assessing reproducibility of clustering 
patterns observed in analyses of microarray data" and Radmacher et a!., "A paradigm for class 
prediction using gene expression profiles." 

The compound covariate predictor tool creates a multivariate predictor for one of two 
15 classes for each sample using markers in the multivariate predictor that are univariately significant at 
the selected significance cutoff for a given set of data (see discussion above in Section V, D, 
"Compound Covariate Predictor Analysis. 1 '). The statistical significance cutoff for a given set of 
data can be chosen based upon the level of confidence desired. 

By way of example, the markers provided in Table 1 0 satisfy a cutoff of PO.0005, and are 
20 therefore suitable for use with compound covariate predictor analysis. The multivariate predictor is a 
weighted linear combination of log-ratios for genes that are univariately significant. The weight 
consists of the univariate t-statistics for comparing the classes. 

Using the compound covariate predictor and the markers provided in Table 10, a sample of 
ovarian tissue can be classified into a2W?G4i-like or BRCA2-\ike tumor. Samples are prepared as 
25 described in Example 1, and logarithmic expression ratios obtained for each marker used in the 
compound covariate predictor analysis. 

The markers provided in Table 10 were used to segregate BRCA /-linked and BRCAI-typ& 
sporadic tumor samples from2?i?G42-linked and BRCA2-typt sporadic samples, in a multivariate 
analysis. Based upon the information regarding these classes that was obtained using other 
30 approaches (such as hierarchical clustering, see Example 1), compound covariate predictor analysis 
classified the tumors with 92% accuracy (see Table 1 1). 

Using this method, an unknown tumor can be classified into one of any two groups provided 
that markers that are univariately significant at the selected significance cutoff for the desired groups 
are known. In addition, the gene expression data for the markers should be obtained using the same 
35 reference standard as the sample tumor. 

Further analysis, such as a "leave-one-out" approach may be employed to check the 
veracity of the compound covariate predictor model. In this approach, each of the tumors is 
individually segregated, and the analysis completed using that tumor against the remaining samples. 
In this way, the strength of the data set is measured against each individual sample (tumor), 
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confirming that the data set is useful, independently of any individual sample. See Radmacher et al, 
"A paradigm for class prediction using gene expression profiles," available on the Biometric 
Research Branch, National Cancer Institute Internet site. 

5 EXAMPLE 5 

Expression of Ovarian Cancer-related Polypeptides 

This example describes how to express the ovarian cancer-related proteins disclosed herein 
using various techniques. 

10 

The disclosed ovarian cancer-related proteins (and fragments thereof) can be expressed by 
standard laboratory technique. After expression, the purified ovarian cancer-related protein or 
polypeptide may be used for instance for functional analyses, antibody production, diagnostics, 
prognostics, and patient therapy, e.g., for prevention or treatment of ovarian cancer. Furthermore, the 

15 DNA sequences encoding the disclosed ovarian cancer-related proteins can be manipulated in studies 
to understand the expression of these genes and the function of their products. Mutant forms of 
human ovarian cancer-related proteins (and corresponding encoding sequences) may be isolated 
based upon information contained herein, and may be studied in order to detect alteration in 
expression patterns in terms of relative quantities, tissue specificity and functional properties of the 

20 encoded mutant ovarian cancer-related protein. Partial or full-length cDNA sequences that encode 
the subject protein may be ligated into bacterial expression vectors. Methods for expressing large 
amounts of protein from a cloned gene introduced into Escherichia coli (E. coli) or other prokaryotes 
may be utilized for the purification, localization, and functional analysis of proteins. For example, 
fusion proteins consisting of amino terminal peptides encoded by a portion of the E. coli lacZot trpE 

25 gene linked to an ovarian cancer-related protein may be used to prepare polyclonal and monoclonal" 
antibodies against these proteins. Thereafter, these antibodies may be used to purity proteins by 
immunoaffinity chromatography, in diagnostic assays to quantitate the levels of protein and to 
localize proteins in tissues and individual cells by immunofluorescence. 

Intact native protein may also be produced in E. coli in large amounts for functional studies. 

30 Methods and plasmid vectors for producing fusion proteins and intact native proteins in bacteria are 
described in Sambrook et al. QnMolecular Cloning: A Laboratory Manual, Ch. 17, CSHL, New 
York, 1 989). Such fusion proteins may be made in large amounts, are easy to purify, and can be used 
to elicit antibody response. Native proteins can be produced in bacteria by placing a strong, regulated 
promoter and an efficient ribosome-binding site upstream of the cloned gene. If low levels of protein 

3 5 are produced, additional steps may be taken to increase protein production; if high levels of protein 
are produced, purification is relatively easy. Suitable methods are presented in Sambrook et al (In 
Molecular Cloning: A Laboratory Manual, CSHL, New York, 1989) and are well known in the art. 
Often, proteins expressed at high levels are found in insoluble inclusion bodies. Methods for 
extracting proteins from these aggregates are described by Sambrook et al (In Molecular Cloning: A 
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Laboratory Manual, Ch. 17, CSHL, New York, 1989). Vector systems suitable for the expression of 
lacZ fusion genes include the pUR series of vectors (see Ruther and Muller-Hill, EMBOl 2:1791, 
1983), pEXl-3 (see Stanley and Luzio, EMBO J. 3: 1429, 1984) and pMRl 00 (see Gray et al, Proc. 
Natl Acad. Set USA 79:6598, 1982). Vectors suitable for the production of intact native proteins 
5 include pKC30 (see Shimatake and Rosenberg, Nature 292:128, 1981), pKK177-3 (see Amann and 
Brosius, Gene 40:183, 1985) and pET-3 (see Studiar and Moffatt, J. Mol Biol 189:1 13, 1986). 
Fusion proteins, for instance fusions that incorporate a portion of an ovarian cancer-related protein, 
may be isolated from protein gels, lyophilized, ground into a powder and used as an antigen. The 
DNA sequence can also be transferred from its existing context to other cloning vehicles, such as 

1 0 other plasmids, bacteriophages, cosmids, animal viruses and yeast artificial chromosomes (YACs) 
(see Burke et al, Science 236:806-812, 1987). These vectors may then be introduced into a variety 
of hosts including somatic cells, and simple or complex organisms, such as bacteria, fungi (see ' 
Timberlake and Marshall, Science 244:1313-1317, 1989), invertebrates, plants (see Gasser and 
Fraley, Science 244:1293, 1989), and animals (seePursel et al, Scfewce 244:1281-1288, 1989), which 

15 cell or organisms are rendered transgenic by the introduction of the heterologous ovarian cancer- 
related cDNA. 

For expression in mammalian cells, the cDNA sequence may be ligated to heterologous 
promoters, such as the simian virus (SV) 40 promoter in the pSV2 vector (see Mulligan and Berg, 
Proc. Natl Acad. Set USA 78:2072-2076, 1981), and introduced into cells, such as monkey COS-1 
20 cells (see Gluzman, Cell 23:175-182, 1981), to achieve transient or long-term expression. The stable 
integration of the chimeric gene construct may be maintained in mammalian cells by biochemical 
selection, for example with neomycin (see Southern and Berg, J. Mol Appl Genet 1: 327-341, 1982) 
or mycophenolic acid (see Mulligan and Berg, Proc. Natl Acad. Set USA 78: 2072-2076, 1981). 
DNA sequences can be manipulated with standard procedures such as restriction enzyme 
25 digestion; fill-in with DNA polymerase, deletion by exonuclease, extension by terminal ' " 

deoxynucleotide transferase, ligation of synthetic or cloned DNA sequences, site-directed sequence- 
alteration via single-stranded bacteriophage intermediate or with the use of specific oligonucleotides 
in combination with PCR. 

The cDNA sequence (or portions derived from it) or a mini gene (a cDNA with an intron 
30 and its own promoter) may be introduced into eukaryotic expression vectors by conventional 

techniques. These vectors are designed to permit the transcription of the cDNA in eukaryotic cells by 
providing regulatory sequences that initiate and enhance the transcription of the cDNA and ensure its 
proper splicing and polyadenylation. Vectors containing the promoter and enhancer regions of the 
SV40 or long terrninal repeat (LTR) of the Rous Sarcoma virus and polyadenylation and splicing 
35 signal from SV40 are readily available (see Mulligan et al, Proc. Natl Acad. Set USA 

78:1078-2076, 1981; Gorman etal, Proc. Natl Acad. Sci USA 78:6777-6781, 1982). The level of 
expression of the cDNA can be rnanipulated with this type of vector, either by using promoters that 
have different activities (for example, the baculovirus pAC373 can express cDNAs at high levels in 
S.frugiperda cells (see Summers and Smith, In Genetically Altered Viruses and the Environment, 
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Fields et al (Eds.) 22:319-328, CSHL Press, Cold Spring Harbor, New York, 1985) or by using 
vectors that contain promoters amenable to modulation, for example, the glucocorticoid-responsive 
promoter from the mouse mammary tumor virus (see Lee et al, Nature 294:228, 1982). The 
expression of the cDNA can be monitored in the recipient cells 24 to 72 hours after introduction 
5 (transient expression). 

In addition, some vectors contain selectable markers such as the gpt (see Mulligan and Berg, 
Proc. Natl Acad. Sci USA 78:2072-2076, 1981) or neo (see Southern and Berg, /. Mol Appl Genet 
1:327-341, 1982) bacterial genes. These selectable markers permit selection of transfected cells that 
exhibit stable, long-term expression of the vectors (and therefore the cDNA). The vectors can be 
1 0 maintained in the cells as episomal, freely replicating entities by using regulatory elements of viruses 
such as papilloma (see Sarver et al, Mol CellBiol 1:486, 1981)orEpstein-Barr(seeSugdene/a/., 
Mol Cell Biol 5:410, 1985). Alternatively, one can also produce cell lines that have integrated the 
vector into genomic DNA. Both of these types of cell lines produce the gene product on a continuous 
basis. One can also produce cell lines that have amplified the number of copies of the vector (and 
15 therefore of the cDNA as well) to create cell lines that can produce high levels of the gene product 
(see Alt et al 9 J. Biol Chem. 253:1357, 1978). 

The transfer of DNA into eukaryotic, in particular human or other mammalian cells, is now 
a conventional technique. The vectors are introduced into the recipient cells as pure DNA 
(transfection) by, for example, precipitation with calcium phosphate (see Graham and vander Eb, 
20 Virology 52:466, 1973) or strontium phosphate (see Brash et al, Mol Cell Biol 7:2013, 1987), 

electroporation (see Neumann et al. EMBO J 1:841, 1982), lipofection (see Feigner et al, Proc. Natl 
Acad. Sci USA 84:7413, 1987), DEAE dextran (see McCuthan et al, J. Natl Cancer Inst. 41:351, 
1968), microinjection (see Mueller et al, Cell 15:579, 1978), protoplast fusion (see Schafiier, Proc. 
Natl Acad. Sci. USA 77:2163-2167 \ 1980), or pellet guns (see Klein et al, Nature 327:70, 1987). 
25 Alternatively, the cDNA, or fragments thereof, can be introduced by infection with virus vectors. 
Systems are developed that use, for example, retroviruses (see Bernstein et al, Gen. Engr'g 7:235, 
1985), adenoviruses (see Ahmad et al, J. Virol. 57:267, 1986), or Herpes virus (see Spaete et al, Cell 
30:295, 1982). MB1 encoding sequences can also be delivered to target cells in vitro via non- 
infectious systems, for instance liposomes. 
30 These eukaryotic expression systems can be used for studies of ovarian cancer-related 

nucleic acids (such as those listed in Table 1) and mutant forms of these molecules, as well as ovarian 
cancer-related proteins and mutant forms of these protein. Such uses include, for example, the 
identification of regulatory elements located in the 5' region of ovarian cancer-related genes on 
genomic clones that can be isolated from human genomic DNA libraries. The eukaryotic expression 
35 systems may also be used to study the function of the normal ovarian cancer-related proteins, specific 
portions of these proteins, or of naturally occurring or artificially produced mutant versions of 
ovarian cancer-related proteins. 

Using the above- techniques, the expression vectors containing ovarian cancer-related gene 
sequence or cDNA, or fragments or variants or mutants thereof; can be introduced into human cells, 
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mammalian cells from other species or non-mammalian cells as desired. The choice of cell is 
determined by the purpose of the treatment. For example, monkey COS cells (see Gluzman, Cell 
23:175-182, 1981) that produce high levels of the SV40 T antigen and permit the replication of 
vectors containing the SV40 origin of replication may be used. Similarly, Chinese hamster ovary 
5 (CHO), mouse NIH 3T3 fibroblasts or human fibroblasts or lymphoblasts may be used. 

The present disclosure thus encompasses recombinant vectors that comprise all or part of an 
ovarian cancer-related gene or cDNA sequence (e.g., those listed in Table 1), for expression in a 
suitable host. In some embodiments, the ovarian cancer-related nucleic acid sequence is operatively 
linked in the vector to an expression control sequence to form a recombinant DNA molecule, so that 

10 the ovarian cancer-related polypeptide can be expressed. The expression control sequence may be 
selected from the group consisting of sequences that control the expression of genes of prokaryotic or 
eukaryotic cells and their viruses, and combinations thereof. The expression control sequence may be 
specifically selected from the group consisting of the lac system, the trp system, the tac system, the 
trc system, major operator and promoter regions of phage lambda, the control region of fd coat 

1 5 protein, the early and late promoters of SV40, promoters derived from polyoma, adenovirus, 

retrovirus, baculovirus and simian virus, the promoter for 3-phosphoglycerate kinase, the promoters 
of yeast acid phosphatase, the promoter of the yeast alpha-mating factors, and combinations thereof. 

The host cell, which may be transfected with the vector of this disclosure, may be selected 
from the group consisting of E. coli, Pseudomonas, Bacillus subtilis, B. stearothermophilus or other 

20 bacilli; other bacteria; yeast; fungi; insect; mouse or other animal; or plant hosts; or human tissue 
cells. 

It is appreciated that for mutant or variant ovarian cancer-related DNA sequences, similar 
systems are employed to express and produce the mutant product. In addition, fragments of an 
ovarian cancer-related protein can be expressed essentially as detailed above. Such fragments include 
25 individual ovarian cancer-related protein domains or sub-domains, as well as shorter fragments such 
as peptides. Ovarian cancer-related protein fragments (<?.g., those having therapeutic properties) may 
be expressed in this manner also. 

EXAMPLE 6 

30 Suppression of Ovarian Cancer-related Increased Gene Expression 

This example describes how the ovarian cancer-related nucleic acids disclosed herein may 
be suppressed using various techniques. 

35 A reduction of ovarian cancer-related protein expression in a transgenic cell may be obtained 

by introducing into cells an antisense construct based on an ovarian cancer-related protein encoding 
sequence, such as a cDNA or gene sequence or flanking regions thereof of any one of the proteins 
encoded by the nucleic acid molecules listed in Table 1, Table 9 or elsewhere herein. For antisense 
suppression, a nucleotide sequence encoding an ovarian cancer-related protein that is overexpressed 
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in ovarian cancer, e.g. all or a portion of the small cell lung carcinoma cluster 4 antigen (CD24) (SEQ 
ID NO: 181-182), secretory leukocyte protease inhibitor antileukoproteinase (SLPI) (SEQ ID NO: 
340-341), secreted phosphoprotein 1 (SPP1) (SEQ ID NO: 342), B-factor, properdin (BF) (SEQ ED 
NO: 343-344), "homolog of Cksl=p34Cdc28/Cdc2-associated protein" (CKS1) (SEQ ID NO: 345- 
5 347), matrix metalloproteinase 7 (MMP7) (SEQ ID NO: 348-349), paired box gene 8 (PAX8) (SEQ 
ID NO: 350-351), serine protease inhibitor, Kunitz type, 2 (SPINT2) (SEQ ID NO: 352-353), ZW10 
interactor (ZWINT) (SEQ ID NO: 354), diacylglycerol kinase {DGKH) (SEQ ID NO: 355), high- 
mobility group (nonhistone chromosomal) protein isoforms I and Y (HMGIY) (SEQ ID NO: 356), 
Syndecan-4 - amphiglycan - ryudocan core protein (SDC4) (SEQ ID NO: 357-359), cyclin-dependent 
10 kinase inhibitor 2A (CDKN2A) (SEQ ID NO: 360), sodium channel, nonvoltage-gated 1 alpha 

(SCNN1A) (SEQ ID NO: 361-362), lactate dehydrogenase A (LDHA) (SEQ ID NO: 363), adult folate 
receptor (FOLR1) (SEQ ID NO: 364-365), Triosephosphate isomerase 1 (TPU) (SEQ ID NO: 366- 
367), kallikrein 8 (neuropsin/ovasin) (KLK8) (SEQ ID NO: 368), CXC chemokine receptor 4- fusin- 
neuropeptide Y receptor-L3 (CXCR4) (SEQ ED NO: 200), kinesin-like 1 (KNSL1) (SEQ ID NO: 369- 
15 370), H2A histone family, member O (H2AFO) (SEQ ID NO: 371-372), major histocompatibility 
complex, class II, DR alpha, HLA-DRA, cysteine-rich protein 1 (intestinal) (CRJP1) (SEQ ID NO: 
375), pyrophosphatase (inorganic), (PP) (SEQ ID NO: 376), EST 666391, glucose transporter 
(HepG2) (SLC2A1) (SEQ ID NO: 379-381), EST 897770, hepatoma-derived growth factor (HDGF) 
(SEQ ED NO: 383-385), argininosuccinate synthetase (ASS) (SEQ ID NO: 386), claudin 4 (CLDN4) 
20 (SEQ ID NO: 387-388), preferentially expressed antigen in melanoma (FRAME) (SEQ ID NO: 389), 
LAR = LCA-homologue (PTPRF) (SEQ ID NO: 390-391), eyes absent (Drosophila) homolog 2 
(EYA2) (SEQ ID NO: 392-393), L-myc (MYCL1) (SEQ ID NO: 394-396), STAT1-IFN alpha/beta- 
responsive transcription factor ISGF3 beta subunits (p91/p84) (STAT1) (SEQ ID NO: 397-399), 
mitochondrial carrier homolog 2 (MTCH2) (SEQ ID NO: 400-401), 5-hycfroxyti7ptamine (serotonin) 
25 receptor 3A (HTR3A) (SEQ ID NO: 402), cyclinEl (CCNEJ) (SEQ ID NO: 403-404), cadherin 6, 
type 2, K-cadherin (fetal kidney) (CDH6) (SEQ ID NO: 405), 5'-AMP-activated protein kinase - 
gamma-1 subunit (PRKAG1) (SEQ ID NO: 406-408), defensin beta 1 (DEFB1) (SEQ ID NO: 409), 
actin related protein 2/3 complex, subunit 1 A (41 kD) (ARPC1E) (SEQ ID NO: 410-41 1), PKC 
iota=Protein kinase C, iota (PRKCI) (SEQ ID NO: 412-414), glyceraldehyde-3-phosphate 
30 dehydrogenase (GAPD) (SEQ ID NO: 415), complement component 2 (C2) (SEQ ID NO: 416-417), 
H2A histone family, member Y (H2AFY) (SEQ ID NO: 418-419), transmembrane 4 superfamily 
member 1 (TM4SF1) (SEQ ID NO: 420-421), glyceraldehyde-3-phosphate dehydrogenase (GAPD) 
(SEQ ED NO: 422-423), Interferon-inducible protein 1-8U (IFITM3) (SEQ ID NO: 424-426), glycine 
dehydrogenase (decarboxylating; glycine decarboxylase, glycine cleavage system protein P) (GLDQ 
35 (SEQ ID NO: 427-428), calumenin (CALU) (SEQ ID NO: 429-430), hemoglobin alpha 2 (HBA2) 
(SEQ ID NO: 431-432), S100 calcium-binding protein All (calgizzarin) (S100AU) (SEQ ID NO: 
433), Lactate dehydrogenase A (LDHA) (SEQ ID NO: 434-436), ubiquitin-conjugating enzyme E2C 
(UBE2Q (SEQ ID NO: 437), E2F-3=pRB-binding transcription factor=KIAA0075 (E2F3) (SEQ ED 
NO: 438-440), E-cadherin (CDH1) (SEQ ED NO: 441-442), proteasome (prosome, macropain) 
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activator subunit 2 (PA28 beta) (PSME2) (SEQ ID NO: 443-444), OP-l=osteogenic protein in the 
TGF-beta family (BMP7) (SEQ ED NO: 445-447), and topoisomerase E (TOP2A) (SEQ ID NO: 448) 
cDNA or gene, is arranged in reverse orientation relative to the promoter sequence in the 
transformation vector. Other aspects of the vector may be chosen as for any other expression vector 
5 (see, Example 4). 

The introduced sequence need not be a full-length human ovarian cancer-related cDNA or 
gene, and need not be exactly homologous to the equivalent sequence found in the cell type to be 
transformed. Generally, however, where the introduced sequence is of shorter length, a higher degree 
of homology to the ovarian cancer-related sequence likely will be needed for effective antisense 
10 suppression. The introduced antisense sequence in the vector may be at least thirty nucleotides in 
length, and improved antisense suppression will typically be observed as the length of the antisense 
sequence increases. The length of the antisense sequence in the vector advantageously may be 
greater than 100 nucleotides. 

Although the exact mechanism by which antisense RNA molecules interfere with gene 
15 expression has not been elucidated, it is believed that antisense RNA molecules bind to the 
endogenous mRNA molecules and thereby inhibit translation of the endogenous rnRNA. 

Suppression of endogenous ovarian cancer-related gene expression can also be achieved 
using ribozymes. Ribozymes are synthetic RNA molecules that possess highly specific 
endoribonuclease activity. The production and use of ribozymes are disclosed in U.S. Patent No. 
20 4,987,071 to Cech and U.S. Patent No. 5,543,508 to HaselhorT. The inclusion of ribozyme sequences 
within antisense RNAs may be used to confer RNA cleaving activity on the antisense RNA, such that 
endogenous mRNA molecules that bind to the antisense RNA are cleaved, which in turn leads to an 
enhanced antisense inhibition of endogenous gene expression. 

In addition, dominant negative mutant forms of the disclosed ovarian cancer-related sequences 
25 may be used to block endogenous activity of the corresponding gene products. 

Suppression can also be achieved using small inhibitory RNA molecules (siRNAs) (see, for 
instance, Caplen et aL, Proc. Natl Acad Sci. 98(17): 9742-9747, 2001, and Elbashir et al y Nature 
41 1: 494-498, 2001). Thus, this disclosure also encompasses siRNAs that correspond to an ovarian 
cancer-related nucleic acid, which siRNA is capable of suppressing the expression or function of its 
30 cognate (target) ovarian cancer-related protein. Also encompassed are methods of suppressing the 
expression or activity of an ovarian cancer-related molecule using an siRNA. 

Suppression of expression of an ovarian cancer-related gene can be used, for instance, to treat, 
reduce, or prevent cell proliferative and other disorders caused by over-expression or unregulated 
expression of the corresponding ovarian cancer-related gene. In particular, suppression of expression 
35 of sequences disclosed herein as being up-regulated in ovarian cancer can be used to treat, reduce, or 
prevent progression to a later stage of ovarian cancer. 
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EXAMPLE 7 
Nucleic Acid-Based Analysis 



This example describes how to use the ovarian cancer-related nucleic acids disclosed herein 
5 to detect and analyze neoplasms and mutations in ovarian cancer-related nucleic acids that may result 
in neoplasms. 



The ovarian cancer-related nucleic acid molecules provided herein, and combinations of 
these molecules, can be used in methods of genetic testing for neoplasms (e.g., ovarian or other 

1 0 cancers) or predisposition to neoplasms owing to altered expression of ovarian cancer-related nucleic 
acid molecules (e.g., deletion, genomic amplification or mutation, or over- or under-expression in 
comparison to a control or baseline). For such procedures, a biological sample of the subject, which 
biological sample contains either DNA or RNA derived from the subject, is assayed for a mutated, 
amplified or deleted ovarian cancer-related nucleic acid molecule, or for over- or under-expression of 

15 an ovarian cancer-related nucleic acid molecule. Suitable biological samples include samples 

containing genomic DNA or RNA (including mRNA), obtained from body cells of a subject, such as 
those present in peripheral blood, urine, saliva, tissue biopsy, surgical specimen, amniocentesis 
samples and autopsy material. 

The detection in the biological sample of a mutant ovarian cancer-related nucleic acid 

20 molecule, a mutant ovarian cancer-related RNA, an amplified or homozygously or heterozygously 
deleted ovarian cancer-related nucleic acid molecule, or over- or under-expression of an ovarian 
cancer-related nucleic acid molecule, may be performed by a number of methodologies, examples of 
which are provided. 

25 A. Detection of Unknown Mutations: 

Unknown mutations in ovarian cancer-related nucleic acid molecules can be identified 
through polymerase chain reaction amplification of reverse transcribed RNA (RT-PCR) or DNA 
isolated from breast or ovary or other tissue, followed by direct DNA sequence determination of the 
products; single-strand conformational polymorphism analysis (SSCP) (for instance, see Hongyo et 

30 al, Nucleic Acids Res. 21:3637-3642, 1993); chemical cleavage (including HOT cleavage) (Bateman 
et al, Am. 1 Med. Genet. 45:233-240, 1993; reviewed in Ellis et al, Hum. Mutat. 11:345-353, 1998); 
denaturing gradient gel electrophoresis (DGGE); ligation amplification mismatch protection 
(LAMP); or enzymatic mutation scanning (Taylor and Deeble, Genet Anal 14:181-186, 1999), 
followed by direct sequencing of amplicons with putative sequence variations. 

35 

B. Detection of Known Mutations: 

The detection of specific known DNA mutations in ovarian cancer-related nucleic acid 
molecules may be achieved by methods such as hybridization using allele specific oligonucleotides 
(ASOs) (see Wallace et al, CSHL Symp. Quant. Biol 51:257-261, 1986), direct DNA sequencing 
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(see Church and Gilbert, Proc. Nail. Acad. Sci. USA 81:1991-1995, 1988), the use of restriction 
enzymes (see Flaveller a/., Cell 15:25, 1978;Geever etal., Proc. Nail. Acad. Sci. U.S.A. 8(8): 5081- 
5085, 1981), discrimination on the basis of electrophoretic mobility in gels with denaturing reagent 
(see Myers and Maniatis, Cold Spring Harbor5yff//7. Quant. Biol. 51:275-284, 1986),RNase 
protection (see Myers et ah, Science 230:1242, 1985), chemical cleavage (see Cotton et al, Proc. 
Natl. Acad. Sci. USA 85:4397-4401, 1985), and the ligase-mediated detection procedure (see 
Landegren et al.. Science 241:1077-1080, 1988). Oligonucleotides specific to normal or mutant MB1 
sequences can be chemically synthesized using commercially available machines. These 
oligonucleotides can then be labeled radioactively with isotopes (such as 32 P) or non-radioactively, 
with tags such as biotin (see Ward and Langer et al., Proc. Natl. Acad. Sci. USA 78:6633-6657, 
1981), and hybridized to individual DNA samples immobilized on membranes or other solid supports 
by dot-blot or transfer from gels after electrophoresis. These specific sequences are visualized by 
methods such as autoradiography or fluorometric (see Landegren et al.. Science 242:229-237, 1989) 
orcolormretricreactioiis(seeGebeyehuera/.,^c/eic^c/V/i/{ej. 15:4513-4534, 1987). Using an 
15 ASO specific for a normal allele, the absence of hybridization would indicate a mutation in the 

particular region of the gene, or deleted MB1 gene. In contrast, if an ASO specific for a mutant allele 
hybridizes to a clinical sample then that would indicate the presence of a mutation in the region 
defined by the ASO. 

20 C. Detection of Genomic Amplification or Deletion 

Gene dosage (copy number) can be important in neoplasms; it is therefore advantageous to 
determine the number of copies of ovarian cancer-related nucleic acids in biological samples of a 
subject, e.g., serum or ovary samples. Probes generated from the disclosed encoding sequence of in 
ovarian cancer-related nucleic acid molecules can be used to investigate and measure genomic dosage 

25 of the corresponding ovarian cancer-related genomic sequence. ' 

Appropriate techniques for measuring gene dosage are known in the art; see for instance, US 
Patent No. 5,569,753 ("Cancer Detection Probes") and Pinkel et al. (Nat. Genet. 20:207-21 1, 1998) 
("High Resolution Analysis of DNA Copy Number Variation using Comparative Genomic 
Hybridization to Microarrays"). 
30 Determination of gene copy number in cells of a patient-derived sample using other 

techniques is known in the art. For example, amplification of an ovarian cancer-related nucleic acid 
sequence in cancer-derived cell lines as well as uncultured ovarian cancer or other cells can be carried 
out using bicolor FISH analysis. By way of example, interphase FISH analysis of breast cancer cell 
lines can be carried out as previously described (see Barlund et al, Genes Chromo. Cancer 20:372- 
35 376, 1997). The hybridizations can be evaluated using a Zeiss fluorescence microscope. 

For tissue microanays, the FISH can be performed as described in Kononen et al {Nat 
Med. 4:844-847, 1998). Briefly, consecutive sections of the array are deparaffinized, dehydrated in 
ethanol, denatured at 74° C for 5 minutes in 70% formamide/2 x SSC, and hybridized with test and 
reference probes. The specimens containing tight clusters of signals or >3-foId increase in the 
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number of test probe as compared to chromosome 17 centromere in at least 10% of the tumor cells 
maybe considered as amplified. Microarrays can be constructed as described in WO 99/44063A2 
and WO 99/44062A1. 

5 C. Detection of mRNA Expression Levels 

Altered expression of an ovarian cancer-related molecule also can be detected by measuring 
the cellular level of ovarian cancer-related nucleic acid molecule-specific mRNA. mRNA can be 
measured using techniques well known in the art, including for instance Northern analysis, RT-PCR 
and mRNA in situ hybridization. Details of mRNA analysis procedures can be found, for instance, in 
10 Example 1, Example 3, and Sambrook et al (ed), Molecular Cloning: A Laboratory Manual, 2nd 
ed., vol. 1-3, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 

The nucleic acid-based diagnostic methods of this disclosure are predictive of ovarian 
cancer. Cells of any tumors that demonstrate altered expression levels {e.g., through genomic 
amplification, deletion, mutation, or other over- or under-expression) of nucleotide sequences that 
1 5 share homology with the ovarian cancer-related nucleic acids disclosed herein are aggressive tumor 
cells, and result in decreased survival, increased metastasis, increased rates of clinical, and overall 
worsened prognosis. 

EXAMPLE 8 

20 Production of Protein Specific Binding Agents 

This example describes how to use the ovarian cancer-related molecules disclosed herein to 
produce binding agents useful in preventing ovarian cancer. 

25 Monoclonal or polyclonal antibodies may be produced to any of the disclosed ovarian 

cancer-related proteins, or mutant forms of these proteins. Optimally, antibodies raised against these 
proteins, or peptides from within such proteins, would specifically detect the protein or peptide with 
which the antibodies are generated. That is, an antibody generated to the BMP7 protein or another 
specified protein (see Table 1) or a fragment thereof would recognize and bind that protein and would 

30 not substantially recognize or bind to other proteins found in human cells. 

The determination that an antibody specifically detects a designated protein (e.g., an ovarian 
cancer-related protein as disclosed herein) can be made by any one of a number of standard 
immunoassay methods; for instance, the Western blotting technique (see Sambrook et al, In 
Molecular Cloning: A Laboratory Manual, CSHL, New York, 1989). To determine that a given 

35 antibody preparation (such as one produced in a mouse) specifically detects a designated protein by 
Western blotting, total cellular proteins are extracted from cells (for example, human ovary) and 
electrophoresed on a sodium dodecyl sulfate-polyacrylamide gel. The proteins are then transferred to 
a membrane (for example, nitrocellulose) by Western blotting, and the antibody preparation is 
incubated with the membrane. After washing the membrane to remove non-specifically bound 
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antibodies, the presence of specifically bound antibodies is detected by the use of an anti-mouse 
antibody conjugated to an enzyme such as alkaline phosphatase. Application of an alkaline 
phosphatase substrate 5-bromo-4-chloro-3-indolyl phosphate/nitro blue tetrazolium results in the 
production of a dense blue compound by immunolocalized alkaline phosphatase. Antibodies that 
5 specifically detect the designated protein will, by this technique, be shown to bind to the designated 
protein band (which will be localized at a given position on the gel determined by its molecular 
weight). Non-specific binding of the antibody to other proteins may occur and may be detectable as a 
weak signal on the Western blot. The non-specific nature of this binding will be recognized by one 
skilled in the art by the weak signal obtained on the Western blot relative to the strong primary signal 

1 0 arising from the specific antibody-protein binding. 

Substantially pure ovarian cancer-related protein or protein fragment (peptide) suitable for 
use as an immunogen may be isolated from transfected or transformed cells, as described above. 
Concentration of protein or peptide in the final preparation is adjusted, for example, by concentration 
on an Amicon filter device, to the level of a few micrograms per milliliter. Monoclonal or polyclonal 

1 5 antibody to the protein can then be prepared as follows: 

A. Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of a designated protein (such as an ovarian cancer-related 
protein, including one encoded by a nucleic acid listed in Table 1) identified and isolated as described 

20 can be prepared from murine hybridomas according to the classical method of Kohler and Milstein 
{Nature 256:495-497, 1975) or derivative methods thereof. Briefly, a mouse is repetitively 
inoculated with a few micrograms of the selected protein over a period of a few weeks. The mouse is 
then sacrificed, and the antibody-producing cells of the spleen isolated. The spleen cells are fused by 
means of polyethylene glycol with mouse myeloma cells, and the excess un-fused cells destroyed by 

25 growth of the system on selective media comprising aminopterin (HAT media). The successfully ' " 
fused cells are diluted and aliquots of the dilution placed in wells of a microliter plate where growth 
of the culture is continued. Antibody-producing clones are identified by detection of antibody in the 
supernatant fluid of the wells by immunoassay procedures, such as ELISA, as originally described by 
Engvall (Meth. Enzymol 70; 419-439, 1980)^ and derivative methods thereof. Selected positive 

30 clones can be expanded and their monoclonal antibody product harvested for use. Detailed 

procedures for monoclonal antibody production are described in Harlow and Lane (Antibodies, A 
Laboratory Manual, CSHL, New York, 1988). 

B. Polyclonal Antibody Production by Immunization 
35 Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single protein can 

be prepared by irnmunizing suitable animals with the expressed protein, which can be unmodified or 
modified to enhance imnTunogenicity. Effective polyclonal antibody production is affected by many 
factors related both to the antigen and the host species. For example, small molecules tend to be less 
immunogenic than others and may require the use of carriers and adjuvant Also, host animals vary 
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inresponse to site of inoculations and dose, with either inadequate or excessive doses of antigen 
resulting in low titer antisera. Small doses (ng level) of antigen administered at multiple intradermal 
srtes appear to be most reliable. An effective immunization protocol for rabbits can be found in 
Vaitukaitis etal. (J. Clin. Endocrinol. Metab. 33: 988-991, 1971). 

Booster injections can be givenat regular intervals, and antiserum harvested when antibody 
titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion in agar 
agamst known concentrations of the antigen, begins to fell. See, for example, Ouchterlony et al (In 
Handbook of Experimental Immunology, Wicr(ed.) Chapter 19. BlackweU, 1973). Plateau 
concentration of antibody is usually in the range of about 0.1 to 0.2 mg/ml of serum (about 12 pM) 
Affimty of the antisera for the antigen is determined by preparing competitive binding curves, as 
described, for example, by Fisher (Manual of Clinical Immunology, Ch. 42, 1980). 



C. Antibodies Raised against Synthetic Peptides 

A third approach to raising antibodies against the subject ovarian cancer-related proteins or 
15 peptides is to use one or more synthetic peptides synthesized on a commercially available peptide 
synthesizer based upon the predicted amino acid sequence of the desired ovarian cancer-related 
protein or peptide. 



20 



25 



30 



35 



D. Antibodies Raised by Injection of Protein Encoding Sequence 

Antibodies also may be raised against proteins and peptides related to ovarian cancer as 
described herein by subcutaneous injection of a DNA vector that expresses the desired ovarian 
cancer-related protein, or a fragment thereof, into laboratory animals, such as mice. Delivery of the 
recombinant vector into the animals may be achieved using a hand-held form of the Biolistic system 
(see Sanford etal., Particulate Sci. Technol. 5:27-37, 1987) as described by Tang etal. (Nature 
. 356:152-154, 1992). Expression vectors suitable for this purpose may include those that express the 
ovarian cancer-related sequence under the transcriptional control of either the human p-actin 
promoter or the cytomegalovirus (CMV) promoter. 

Antibody preparations prepared according to these protocols are useful in quantitative 
immunoassays that determine concentrations of antigen-bearing substances in biological samples; 
they also can be used semi-quantitatively or qualitatively to identify the presence of antigen in a ' 
biological sample; or for immunolocalization of the corresponding ovarian cancer-related protein. 

For admimstration to human patients, antibodies, e.g., ovarian cancer-related protein specific 
monoclonal antibodies (such as antibodies to the proteins encoded by the encoding sequences listed 
toinTable 1), can be humanized by methods known in the art. Antibodies with a desired binding 
specificity can be commercially humanized (Scotgene, Scotland, UK; Oxford Molecular, Palo Alto, 
CA). Alternatively, human antibodies can be produced. Methods for producing human antibodies ' 
are known in the art; see, for instance, Canevari etal, bit J. Biol. Markers 8:147-150, 1993 and 
Green, J. Immunol. Meth. 231:11-23, 1999, for instance. 
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EXAMPLE 9 
Protein-Based Analysis 

This example describes how to use the ovarian cancer-related molecules disclosed herein to 
5 quantitate the level of one or more ovarian cancer-related proteins in a subject. 

An alternative method of diagnosing, staging, detecting, or predicting ovarian cancer is to 
quantitate the level of one or more ovarian cancer-related proteins in a subject, for instance in the 
cells of the subject. This diagnostic tool is useful for detecting reduced or increased levels of ovarian 
1 0 cancer-related proteins. Localization and/or coordinated expression (temporally or spatially) of 
ovarian cancer-related proteins can also be examined using well known techniques. The 
determination of reduced or increased ovarian cancer-related protein levels, in comparison to such 
expression in a normal subject (eg., a subject not having ovarian cancer or not having a 
predisposition developing this condition, disease or disorder, would be an alternative or supplemental 
1 5 approach to the direct determination of ovarian cancer-related nucleic acid levels by the methods 
outlined above and equivalents. The availability of antibodies specific to specific ovarian cancer- 
related protein(s) will facilitate the detection and quantitation of cellular ovarian cancer-related 
protein(s) by one of a number of immunoassay methods which are well known in the art and are 
presented in Harlow and Lane (Antibodies, A Laboratory Manual, CSHL, New York, 1988). 
20 Methods of constructing such antibodies are discussed above, in Example 7. 

Any standard immunoassay format (eg., ELISA, Western blot, or RIA assay) can be used to 
measure altered expression of ovarian cancer-related polypeptide or protein levels; comparison is to 
wild-type (normal) ovarian cancer-related protein levels, and a difference in ovarian cancer-related 
polypeptide levels is indicative of a biological condition resulting from altered expression of ovarian 
25 cancer-related polypeptides or proteins, such as neoplasia* Whether the key difference is an increase 
or a decrease is dependent on the specific ovarian cancer-related protein under examination, as 
discussed herein. Immunohistochemical techniques may also be utilized for ovarian cancer-related 
polypeptide or protein detection and quantification. For example, a tissue sample maybe obtained 
from a subject, and a section stained for the presence of an ovarian cancer-related protein using the 
30 appropriate ovarian cancer-related protein specific binding agent and any standard detection system 
(eg., one which includes a secondary antibody conjugated to horseradish peroxidase). General 
guidance regarding such techniques can be found in, eg, Bancroft and Stevens (Theory and Practice 
of Histological Techniques, Churchill Livingstone, 1982) and Ausubel etal (Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, 1998). 
35 For the purposes of quantitating an ovarian cancer-related protein, a biological sample of the 

subject, which sample includes cellular proteins, is required. Such a biological sample may be 
obtained from body cells, such as those present in peripheral blood, urine, saliva, tissue biopsy, 
amniocentesis samples, surgical specimens and autopsy material, particularly breast cells. 
Quantitation of an ovarian cancer-related protein can be achieved by immunoassay and the amount 
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compared to levels of the protein found in healthy cells. A significant difference (either increase or 
decrease) in the amount of ovarian cancer-related protein in the cells of a subject compared to the 
amount of the same ovarian cancer-related protein found in normal human cells is usually about a 
10% or greater change, for instance 20%, 30%, 40%, 50% or greater difference. Substantial under- 
or over-expression of one or more ovarian cancer-related protein(s), may be indicative of neoplasia c 
a predilection to neoplasia or metastasis, and especially ovarian epithelial cancer. 

The protein-based diagnostic methods as described herein are predictive of ovarian cancer. 
Cells of any tumors that demonstrate altered expression levels (e.g., through genomic amplification, 
deletion, mutation, or other over- or under-expression) of nucleotide sequences that share homology 
with the ovarian cancer-related nucleic acids disclosed herein are aggressive tumor cells, and result ii 
decreased survival, increased metastasis, increased rates of clinical recurrence, and overall worsened 
prognosis. 



EXAMTLE 10 
Gene Therapy 

This example describes how to use the ovarian cancer-related molecules and analysis 
methods disclosed herein to effect gene therapy for the treatment of ovarian cancer. 



20 Gene therapy approaches for combating neoplasia (particularly ovarian cancer) in subjects 

are made possible by the present disclosure. 

Retroviruses have been considered a preferred vector for experiments in gene therapy, with a 
high efficiency of infection and stable integration and expression (see Orkin et aL, Prog. Med. Genet 
7:130-142, 1988). A full-length ovarian cancer-related gene or cDNA can be cloned into a retroviral 
25 vector and driven from either its endogenous promoter or from the retroviral LTR (long terminal 
repeat). Other viral transfection systems may also be utilized for this type of approach, including 
adenovirus, adeno-associated virus (AAV) (see McLaughlin et aL, J. Virol. 62:1963-1973, 1988), 
Vaccinia virus (Moss et al } Annu. Rev. Immunol. 5:305-324, 1987), Bovine Papilloma virus 
(Rasmussen et aL, Methods Enzymol 139:642-654, 1987) or members of the herpesvirus group such 
30 as Epstein-Barr virus (Margolskee et al, Mol. Cell. Biol. 8:2837-2847, 1988). 

Developments in gene therapy techniques include the use of KNA-DNA hybrid 
oligonucleotides, as described by Cole-Strauss et al. (Science 273:1386-1389, 1996). This technique 
may allow for site-specific integration of cloned sequences, thereby permitting accurately targeted 
gene replacement. 

35 In addition to delivery of ovarian cancer-related protein encoding sequences to cells using 

viral vectors, it is possible to use non-infectious methods of delivery. For instance, lipidic and 
Hposome-mediated gene delivery has recently been used successfully for transfection with various 
genes (for reviews, see Templeton and Lasic, MoL Biotechnol 11:175-180, 1999; Lee and Huang, 
Crit. Rev. Ther. Drug Carrier Syst. 14:173-206; and Cooper, Semin. Oncol 23:172-187, 1996). For 
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instance, cationic liposomes have been analyzed for their ability to transfect monocytic leukemia 
cells, and shown to be a viable alternative to using viral vectors (de Lima et a/., Mol Membr. Biol 
16:103-109, 1999). Such cationic liposomes can also be targeted to specific cells through the 
inclusion of, for instance, monoclonal antibodies or other appropriate targeting ligands (see Kao et 
5 al, Cancer Gene Ther. 3:250-256, 1996). 

To reduce the level of ovarian cancer-related gene expression, gene therapy can be carried 
out using antisense or other suppressive constructs, the construction of which is discussed above 
(Example 4). 

10 EXAMPLE 11 

Kits 

This example describes various kits for using the ovarian cancer-related molecules and 
analysis methods disclosed herein. 

15 

Kits are provided to determine the level (or relative level) of expression of one or more 
species of ovarian cancer-related nucleic acids (e.g. } mRNA) or one or more ovarian cancer-related 
protein (Le., kits containing nucleic acid probes or antibodies or other ovarian cancer-related protein 
specific binding agents). Kits are also provided that contain the necessary reagents for detennining 
20 gene copy number (genomic amplification or deletion), such as probes or primers specific for an 
ovarian cancer-related nucleic acid sequence. These kits can each include instructions, for instance 
instructions that provide calibration curves or charts to compare with the determined (e.g., 
experimentally measured) values. 

25 A. Kits for Detection of Ovarian Cancer-related Genomic 

Amplification or Deletion 

The nucleotide sequence of ovarian cancer-related nucleic acid molecules disclosed herein, 
and fragments thereof, can be supplied in the form of a kit for use in detection of ovarian cancer- 
related genomic amplification/deletion and/or diagnosis of progression to or predilection to progress 

30 to ovarian epithelial cancer. In such a kit, an appropriate amount of one or more oligonucleotide 
primer specific for an ovarian cancer-related-sequence is provided in one or more containers. The 
oligonucleotide primers may be provided suspended in an aqueous solution or. as a rxeeze-dried or 
iyophilized powder, for instance. The containers) in which the oligonucleotide(s) are supplied can 
be any conventional container that is capable of holding the supplied form, for instance, microfuge 

35 tubes, ampoules, or bottles. In some applications, pairs of primers may be provided in pre-measured 
single use amounts in individual, typically disposable, tubes, or equivalent containers. With such an 
arrangement, the sample to be tested for the presence of ovarian cancer-related genomic 
amplification/deletion can be added to the individual tubes and in vitro amplification carried out 
directly. 
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The amount of each oligonucleotide primer supplied in the kit can be any amount, 
depending for instance on the market to which the product is directed. For instance, if the kit is 
adapted for research or clinical use, the amount of each oligonucleotide primer provided likely would 
be an amount sufficient to prime several in vitro amplification reactions. Those of ordinary skill in 
5 the art know the amount of oligonucleotide primer that is appropriate for use in a single amplification 
reaction. General guidelines may for instance be found in Innis et al (PCR Protocols, A Guide to 
Methods and Applications, Academic Press, Inc., San Diego, CA, 1990), Sambrook et al (In 
Molecular Cloning; A Laboratory Manual, Cold Spring Harbor, New York, 1989), and Ausubel et 
al (In Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1998). 
10 A kit may include more than two primers, in order to facilitate the in vitro amplification of 

ovarian cancer-related genomic sequences (or a protein of such a sequence), for instance an ovarian 
cancer-related nucleic acid listed in Table 1, or the 5' or 3* flanking region thereof. 

In some embodiments, kits may also include the reagents necessary to carry out in vitro 
amplification reactions, including, for instance, DNA sample preparation reagents, appropriate 
15 buffers (e.g., polymerase buffer), salts (e.g, magnesium chloride), and deoxyribonucleotides 
(dNTPs). Written instructions may also be included. 

Kits may in addition include either labeled or unlabeled oligonucleotide probes for use in 
detection of the in vitro amplified sequences. The appropriate sequences for such a probe will be any 
sequence that falls between the annealing sites of two provided oligonucleotide primers, such that the 
20 sequence the probe is complementary to is amplified during the in vitro amplification reaction (if it is 
present in the sample). 

It may also be advantageous to provided in the kit one or more control sequences for use in 
the in vitro amplification reactions. The design of appropriate positive control sequences is well 
known to one of ordinary skill in the appropriate art. 

25 

B. Kits for Detection of mRNA Expression 

Kits similar to those disclosed above for the detection of ovarian cancer-related genomic 
amphfication/deletion can be used to detect ovarian cancer-related mRNA expression levels 
(including over- or under-expression, in comparison to the expression level in a control sample). 
30 Such kits include an appropriate amount of one or more of the oligonucleotide primers for use in, for 
instance, reverse transcription PCR reactions, similarly to those provided above, with art-obvious 
modifications for use with RNA. 

In some embodiments, kits for detection of ovarian cancer-related mRNA expression may 
also include reagents necessary to carry out RT-PCR or other in vitro amplification reactions, 
35 including, for instance, RNA sample preparation reagents (including e.g., an RNAse inhibitor), 
appropriate buffers (e.g. t polymerase buffer), salts (e.g. y magnesium chloride), and 
deoxyribonucleotides (dNTPs). Written instructions may also be included. 

Kits may in addition include either labeled or unlabeled oligonucleotide probes for use in 
detection of an in vitro amplified target sequence. The appropriate sequences for such a probe will be 
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any sequence that falls between the annealing sites of the two provided oligonucleotide primers, such 
that the sequence the probe is complementary to is amplified during the PCR reaction. 

It may also be advantageous to provided in the kit one or more control sequences for use in 
the in vitro amplification reactions. The design of appropriate positive control sequences is well 
5 known to one of ordinary skill in the appropriate art. 

Alternatively, kits may be provided with the necessary reagents to carry out quantitative or 
semi-quantitative Northern analysis of ovarian cancer-related mRNA. Such kits include, for instance, 
at least one ovarian cancer-related sequence-specific oligonucleotide for use as a probe. This 
oligonucleotide may be labeled in any conventional way, including with a selected radioactive 
10 isotope, enzyme substrate, co-factor, ligand, chemnimiinescent or fluorescent agent, hapten, or 
enzyme. 

C. Kits For Detection of Ovarian Cancer-linked Protein or 
Peptide Expression 

1 5 Kits for the detection of ovarian cancer-linked protein expression, for instance altered (over 

or under) expression of a protein encoded for by a nucleic acid molecule listed in Table 1 or 
elsewhere, are also encompassed herein. Such kits may include for example at least one target 
(ovarian cancer-linked) protein (e.g., all or a portion of the small cell lung carcinoma cluster 4 antigen 
(CD24) (SEQ ID NO: 181-182), secretory leukocyte protease inhibitor antOeukoproteinase (SLPJ) 
20 (SEQ ID NO: 340-341), secreted phosphoprotein 1 (SPP1) (SEQ ID NO: 342), B-factor, properdin 
(BF) (SEQ ID NO: 343-344), "homolog of Cksl=p34Cdc28/Cdc2-associated protein" (CKS1) (SEQ 
ID NO: 345-347), matrix metalloproteinase 7 (MMP7) (SEQ ID NO: 348-349), paired box gene 8 
(PAX8) (SEQ ID NO: 350-351), serine protease inhibitor, Kunitz type, 2 (SPINT2) (SEQ ID NO: 
352-353), ZW10 interactor (ZWINT) (SEQ ID NO: 354), diacylglycerol kinase (DGKH) (SEQ ID 
25 NO: 355), high-mobility group (nonhistone chromosomal) protein isbforms I and Y (HMGIY) (SEQ 
ID NO: 356), Syndecan-4 - amphiglycan - ryudocan core protein (SDC4) (SEQ ID NO: 357-359), 
cyclin-dependent kinase inhibitor 2A (CDKN2A) (SEQ ID NO: 360), sodium channel, nonvoltage- 
gated 1 alpha (SCNN1A) (SEQ ID NO: 361-362), lactate dehydrogenase A (LDHA) (SEQ ID NO: 
363), adult folate receptor (FOLR1) (SEQ ID NO: 364-365), Triosephosphate isomerase 1 (TPI1) 
30 (SEQ ID NO: 366-367), kallikrein 8 (neuropsin/ovasin) (KLK8) (SEQ ID NO: 368), CXC chemokine 
receptor 4- fusin-neuropeptide Y receptor-L3 (CXCR4) (SEQ ID NO: 200), kinesin-like 1 (KNSLJ) 
(SEQ ID NO: 369-370), H2A histone family, member O (H2AFO) (SEQ ED NO: 371-372), major 
histocompatibility complex, class II, DR alpha, HLA-DRA, cysteine-rich protein 1 (intestinal) 
(CRIPJ) (SEQ ED NO: 375), pyrophosphatase (inorganic), (PP) (SEQ ID NO: 376), EST (SEQ ID 
35 NO: 377-378) 666391, glucose transporter (HepG2), (SLC2A1) (SEQ ID NO: 379-381), EST (SEQ 
ED NO: 377-378) 897770, hepatoma-derived growth factor (HDGF) (SEQ ID NO: 383-385), 
argininosuccinate synthetase (ASS) (SEQ ID NO: 386), claudin 4 (CLDN4) (SEQ ID NO: 387-388), 
preferentially expressed antigen in melanoma (PRAME) (SEQ ID NO: 389), LAR = LCA-homologue 
(PTPRF) (SEQ ED NO: 390-391), eyes absent (Drosophila) homolog 2 (EYA2) (SEQ ID NO: 392- 
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10 



393), L-myc (MYCLJ) (SEQ ID NO: 394-396), STAT1=IFN alpha/beta-responsive transcription 
factor ISGF3 beta subunits (p91/p84) (STAT1) (SEQ ID NO: 397-399), mitochondrial carrier 
homolog 2 (MTCH2) (SEQ ID NO: 400-401), 5-hydxoxytryptamine (serotonin) receptor 3A (HTR3A) 
(SEQ ID NO: 402), cyclin El (CCNE1) (SEQ ID NO: 403-404), cadherin 6, type 2, K-cadherin (fetal 
kidney) (CDHS) (SEQ ID NO: 405), S^-AMP-activated protein kinase - gamma-1 subunit (PRKAGJ) 
(SEQ ID NO: 406-408), defensin beta 1 (DEFB1) (SEQ ID NO: 409), actin related protein 2/3 
complex, subnnit 1 A (41 kD) {ARPC1B) (SEQ ID NO: 410-411), PKC iota=Protein kinase C, iota 
(PRKO) (SEQ ID NO: 412-414), glyceraIdehyde-3-phosphate dehydrogenase (GAPD) (SEQ ID NO: 
415), complement component2 (C2) (SEQ ID NO: 416-417), H2A histone family, member Y 
(H2AFY) (SEQ ID NO: 418-419), transmembrane 4 superfamily member 1 {TM4SF1) (SEQ ID NO: 
420-421), glyceraldehyde-3-phosphate dehydrogenase (GAPD) (SEQ ID NO: 422-423), Interferon- 
inducible protein 1-8U (IFITM3) (SEQ ID NO: 424-426), glycine dehydrogenase (decarboxylase; 
glycine decarboxylase, glycine cleavage system protein P) (GLDQ (SEQ ED NO: 427-428), 
calumenin (CALU) (SEQ ID NO: 429-430), hemoglobin alpha 2 (HBA2) (SEQ ID NO: 43 1-432), 
15 S 1 00 calcium-binding protein Al 1 (calgizzarin) {S100AU) (SEQ ID NO: 433), Lactate 

dehydrogenase A (LDHA) (SEQ ID NO: 434-436), ubiquitin-conjugating enzyme E2C (UBE2C) 
(SEQ ID NO: 437), E2F-3=pRB-binding transcription factor=KIAA0075 {E2F3) (SEQ ID NO: 438- 
440), E-cadherin (CDH1) (SEQ ID NO: 441-442), proteasome (prosome, macropain) activator 
subunit 2 (PA28 beta) (PSME2) (SEQ ID NO: 443444), OP-l=osteogenic protein in the TGF-beta 
family (BMP7) (SEQ ID NO: 445-447), or topoisomerase H (TOP2A) (SEQ ID NO: 448) specific 
binding agent (e.g., a polyclonal or monoclonal antibody or antibody fragment), and may include at 
least one control. The ovarian cancer-linked protein specific binding agent and control may be 
contained in separate containers. The kits may also include a means for detecting ovarian cancer- 
related protein:agent complexes, for instance the agent may be detectably labeled. If the detectable 
25 agent is not labeled, it may be detected by second antibodies or protein A, for example, either of both 
of which also may be provided in some kits in one or more separate containers. Such techniques are 
well known. 

Additional components in some kits include instructions for carrying out the assay. 
Instructions will allow the tester to determine whether ovarian cancer-linked expression levels are 
30 elevated or reduced in comparison to a control sample. Reaction vessels and auxiliary reagents such as 
chromogens, buffers, enzymes, etc. also maybe included in the kits. 
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EXAMPLE 12 
Identification of Therapeutic Compounds 

This example describes how to use the ovarian cancer-related molecules disclosed herein to 
5 identify compounds for potential therapeutic use in treating, reducing, or preventing ovarian cancer or 
development or progression of ovarian cancer. 

The ovarian cancer-related molecules disclosed herein, and more particularly the linkage of 
these molecules to cancer, can be used to identify compounds that are useful in treating, reducing, or 
10 preventing ovarian cancer or development or progression of ovarian cancer. These molecules can be 
used alone or in combination, for instance in sets of two or more that are linked to cancer or cancer 
progression. 

By way of example, a test compound is applied to a cell, for instance a test cell, and at least 
one ovarian cancer-related molecule level and/or activity in the cell is measured and compared to the 
15 equivalent measurement from a test cell (or from the same cell prior to application of the test 
compound). If application of the compound alters the level and/or activity of an ovarian cancer- 
related molecule (for instance by increasing or decreasing that level), then that compound is selected 
as a likely candidate for further characterization. In particular examples, a test agent that opposes or 
inhibits an ovarian cancer-related change is selected for further study, for example by exposing the 

20 agent to an ovarian epithelial cancer cell in vitro, to determine whether in vitro growth is inhibited. 
Such identified compounds may be useful in treating, reducing, or preventing ovarian cancer or 
development or progression of ovarian cancer. In particular embodiments, the compound isolated 
will inhibit or inactivate an ovarian cancer-related molecule, for instance those represented by the 
nucleic acids listed in Table 1. 

25 Methods for identifying such compounds optionally can include the generation of ah ovarian 

cancer-related gene expression profile, as described herein. Control gene expression profiles useful 
for comparison in such methods may be constructed from normal ovarian tissue, including primary 
ovarian cancer tissue. 

30 EXAMPLE 12 

Gene Expression Profiles (Fingerprints) 

This example describes how to use the ovarian cancer-related nucleic acids and analysis 
methods disclosed herein to generate and use gene expression profiles, or "fingerprints." 

35 

With the provision herein of methods for determining molecules that are linked to ovarian 
cancer, and the provision of a large collection of such ovarian cancer-linked molecules (as 
represented for instance by those listed in Table 1), gene expression profiles that provide information 
on the ovarian cancer-state of a subject are now enabled. 
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Ovarian cancer-related expression profiles comprise the distinct and identifiable pattern of 
expression (or level) of sets of ovarian cancer-related genes, for instance a pattern of high and low 
expression of a defined set of genes, or molecules that can be correlated to such genes, such as 
mRNA levels or protein levels or activities. Useful sets of molecules for constructing nucleic acid 
expression profiles include at least one that is represented by the following genes and ESTs: 
BCKDHB (SEQ ID NO: 16-17), ZNF33A (SEQ ID NO: 20-22), EST 192198 (SEQ ID NO: 25), EST 
128738 (SEQ ID NO: 26-27), EST 42921 1 (28-29), FU22174 (SEQ ID NO: 30-31), EST 415562 
(SEQ ID NO: 32-33), EST 296488 (SEQ ID NO: 34-35), EST 120124 (SEQ ID NO: 36-37), EST 
132142 (SEQ ID NO: 38-39), EST 50635 (SEQ ID NO: 40), POR (SEQ ID NO: 4143), EST 73702 
(SEQ ID NO: 46-47), EST 2218314 (SEQ ID NO: 48), EST 2261 1 13 (SEQ ID NO: 49)! IFITM1 
(SEQ ID NO: 50-54), IFITM2 (SEQ ID NO: 55-59), KIAA0203 (SEQ ID NO: 61-62), G1P3 (SEQ ID 
NO: 68-69), BST2 (SEQ ID NO: 70-72), EST 1384797 (SEQ ID NO: 196) , TLR3 (SEQ ID NO: 199- 
201), SPON1 (SEQ ID NO: 160-161), HSRNASEB (SEQ ID NO: 162-163), EST 294506 (SEQ ID 
NO: 146-148), SORL1 (SEQ ID NO: 149-151), SIAT1 (SEQ ID NO: 73), PL1 (SEQ ID NO: 77), EST 
108422 (SEQ ID NO: 78-79), CEBPG (SEQ ID NO: 80), HLA-DPA (SEQ ID NO: 97-99), H2AFL 
(SEQ ID NO: 107-109), IGKC (SEQ ID NO: 112-116), SCYB10 (SEQ ED NO: 120-121), RGS1 (SEQ 
ID NO: 122-126), LSR68 (SEQ ID NO: 168), SGK (SEQ ID NO: 176-178), and ZFP36 (SEQ ID NO: 
167-173). These genes and ESTs, which have not previously been correlated with cancer, present 
potentially useful novel markers for cancer, and in particular, ovarian cancer. 

A second example set of molecules that could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by the SEQ ID NOs in Table 
9. These nucleic acids, which are disclosed herein to be differentially expressed in ovarian cancer 
(see Figure 2), are suitable for markers to diagnose, prognose, and monitor ovarian cancer in a 
subject. In addition, these genes and ESTs are potentially useful as markers for classifying tumors 
into types, for instance into BRCAJ-type or BRCA2-type rumors, using the meraods-discloseff herein. 

A third example set of molecules that could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ED NOs: 417, 284, 
285, 281, 283, 278, 273, 282, 274. These represent markers disclosed herein that were found to be 
differentially expressed between BRCA J-Lwked and sporadic tumors in a comparison to reference 
Immortalized Ovarian Epithelial Cells (IOSE). These markers are useful for classifying tumors into 
BRCAl-]mke& and sporadic types, and present potential targets for treatment of ovarian cancer. 

A fourth example set of molecules that could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs: 279-280, 
which, as disclosed herein, are markers that were found to be differentially expressed between 
35 £^G42-Linked and sporadic tumors in a comparison to reference Immortalized Ovarian Epithelial 
Cells. These markers are useful for classifying tumors into BRCA2-\iake& and sporadic types, and 
present potential targets for treatment of ovarian cancer. 

A fifth example set of molecules that could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ED NOs: 281, 282 ' 



20 



25 



30 



WO 03/068054 



PCT/US03/04688 



67 

and 274, which, as disclosed herein, are markers that were found to be differentially expressed 
between combined BRCA-Lwked and sporadic tumors in a comparison to reference Immortalized 
Ovarian Epithelial Cells. These markers are userul for classifying tumors into BRCA-linked and 
sporadic types, and present potential targets for treatment of ovarian cancer. 
5 A sixth example set of molecules that could be used in a profile would include at least one 

that is represented by (or correlated to) the genes and ESTs represented by the SEQ ID NOs set forth 
in Table 10, which, as disclosed herein, are markers that can be used to segregate BRCAl-linked from 
BRCA2Amked tumor types using compound covariate prediction analysis. These markers are useful 
for classifying tumors into one of two types of tumors, which provides information helpful to a 
10 clinician in choosing a course of treatment for the patient based on the type of tumor into which the 
sample is classified. 

A seventh example set of molecules that could be used in a profile would include at least 
one that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NO: 16-201, 
565-567, and 803-804. These genes and ESTs were found, as disclosed herein, to be differentially 
15 expressed in a comparison of BRCA 7-linked and BRCA2-lmked to sporadic tumors. Hence, these 
genes and ESTs present potentially useful markers for classifying tumors into types, using the 
methods disclosed herein. Furthermore, they represent potential targets for pharmaceutical treatment 
of tumors of each respective tumor type. 

A eighth example set of molecules that could be used in a profile would include at least one 
20 that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs: 1 24-126, 
319, 429-430, 504-523, 533-535, 544, and 548-799. As disclosed herein, these nucleic acids were 
found to be overexpressed in a comparison of BRCA 7-linked, BRCA2Amktd and sporadic tumor 
samples. Hence, these genes and ESTs present potentially userul markers for classifying rumors into 
types, using the methods disclosed herein. Furthermore, they represent potential targets for 
25 pharmaceutical treatment of tumors of each respective tumor type. ' *" 

A ninth example set of molecules that could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs: 202-339. 
As disclosed herein, these nucleic acids were found to be overexpressed in ovarian cancer in a 
comparison of ovarian epithelial cancer to normal postmenopausal ovarian tissue. Hence, these genes 
30 and ESTs present potentially useful markers diagnosis, prognosis, and monitoring of ovarian cancer. 
In addition, they represent potential targets for pharmaceutical treatment of ovarian tumors. 

A tenth example set of molecules that could be used in a profile would include at least one 
that is represented by (or correlated to) the genes and ESTs represented by SEQ ID NOs:97 and 340- 
448. As disclosed herein, these nucleic acids were found to be underexpressed in ovarian cancer in a 
35 comparison of ovarian epithelial cancer to normal postmenopausal ovarian tissue. Hence, these genes 
and ESTs present potentially userul markers diagnosis, prognosis, and monitoring of ovarian cancer. 
In addition, they represent potential targets for pharmaceutical treatment of ovarian tumors. 

In other examples of ovarian cancer-related gene expression profiles, such profiles may be 
further broken down by the manner of molecules included in the profile. Thus, certain examples of 
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profiles may include a specific class of ovarian cancer markers, such as those molecules involved in 
cell cycle control. 

Particular profiles may be specific for a particular stage of normal tissue (e.g., ovarian 
tissue) growth or disease progression (e.g., progression of ovarian cancer). Thus, gene expression 
5 profiles can be established for a pre-ovarian cancer tissue normal ovarian tissue), and a primary 
ovarian cancer tissue. Each of these profiles includes information on the expression level of at least 
one, but usually two or more, genes that are linked to ovarian cancer (e.g., ovarian cancer-related 
genes). Such information can include relative as well as absolute expression levels of specific genes. 
Likewise, the value measured may be the relative or absolute level of protein expression, which can 
10 be correlated with a "gene expression level." Results from the gene expression profiles of an 

individual subject are often viewed in the context of a test sample compared to a baseline or control 
sample fingerprint. 

The levels of molecules that make up a gene expression profile can be measured in any of 
various known ways, which may be specific for the type of molecule being measured. Thus, nucleic 

15 acid levels (such as direct gene expression levels, such as the level of mRNA expression) can be 
measured using specific nucleic acid hybridization reactions. Protein levels may be measured using 
standard protein assays, using immunologic-based assays (such as ELISAs and related techniques), or 
using activity assays, for instance. Examples for measuring nucleic acid and protein levels are 
provided herein; other methods are well known to those of ordinary skill in the art. 

20 Examples of ovarian cancer-related gene expression profiles can be in array format, such as 

a nucleotide (e.g., polynucleotide) or protein array or microarray. The use of arrays to determine the 
presence and/or level of a collection of biological macromolecules is now well known (see, for 
example, methods described in published PCT application number W09948916, describing hypoxia- 
related gene expression arrays). In array-based measurement methods, an array may be contacted 

25 with polynucleotides (in the case of a nucleic acid-based array) or polypeptides (in the case of a 
protein-based array) from a sample from a subject The amount and/or position of binding of the 
subject's polynucleotides or polypeptides then can be determined, for instance to produce a gene 
expression profile for that subject. Such gene expression profile can be compared to another gene 
expression profile, for instance a control gene expression profile from a subject having a known 
30 gynecological or ovary-related condition. Optionally, the subject's gene expression profile can be 
correlated with one or more appropriate treatments, which may be correlated with a control (or set of 
control) expression profiles for stages of ovarian cancer, for instance. 

This disclosure provides the identification of ovarian cancer-related molecules that exhibit 
35 alterations in expression during development of ovarian cancer, and expression fingerprints (profiles) 
specific for ovarian cancers. It further provides methods of using these identified nucleic acid 
molecules, and proteins encoded thereby, and expression fingerprints or profiles, for instance to 
predict and/or diagnose ovarian cancer, and to elect treatments for instance based on likely response. 
These identified ovarian cancer-related molecules also can serve as therapeutic targets, and can be 
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used in methods for identifying, developing and testing therapeutic compounds. It will be apparent 
that the precise details of the methods described may be varied or modified without departing from 
the spirit of the described invention. We claim all such modifications and variations that fall within 
the scope and spirit of the claims below. 
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CLAIMS 

We claim: 



10 



1 . A method of classifying an ovarian tumor as a BRCA-l-like or BRCA-2-like or 
non-BRCA-like tumor, comprising: 

detennining a pattern of expression in the ovarian tumor of a plurality of markers listed in 
Table 1, wherein the pattern of expression in the ovarian tumor is determined relative to a standard 
ovarian tissue; 

comparing a similarity of the pattern of expression of the plurality of markers in the ovarian 
tumor to a pattern of expression of the plurality of markers in a comparison tissue of a known BRCA- 
l-like or BRCA-2-iike or non-BRCA-like tumor, wherein the pattern of expression in the comparison 
tissue is determined relative to the standard ovarian tissue; 

wherein a similarity of the pattern of expression in the ovarian tumor to a pattern of 
1 5 expression of the comparison tissue of the known BRCA-l-like tumor classifies the ovarian tumor as 
a BRCA-l-like tumor, a similarity of the pattern of expression in the ovarian tumor to a pattern of 
expression of the known BRCA-2-like tumor classifies the ovarian tumor as a BRCA-2-like tumor, 
and a similarity of the pattern of expression in the ovarian tumor to a pattern of expression of the 
known non-BRCA-like tumor classifies the ovarian tumor as a non-BRCA-like tumor. 

20 

2. The method of claim 1, wherein the method comprises detennining a pattern of 
over-expression or under-expression of the plurality of markers in the ovarian tumor to over- 
expression or under-expression of the plurality of markers of the comparison tissue. 

25 3 • ' The method of claim 2, wherein the method comprises detennining a pattern of 

both over-expression and under-expression of the plurality of markers in the ovarian tumor to over- 
expression or under-expression of the plurality of markers in the comparison tissue. 

4. The method of claim 1 wherein the comparison tissue is from a known BRCA-1 - 
30 like tumor, and the method comprises determining whether the ovarian tumor is a BRCA-l-like 

tumor by comparing the pattern of expression in the ovarian tumor to the pattern of expression in the 
comparison tissue. 



5. The method of claim 1 wherein the comparison tissue is from a subject known to 
35 have a mutation in BRCA-1 and the method comprises determining whether the ovarian tumor is a 
BRCA-l-like tumor by comparing the partem of expression in the ovarian tumor to the pattern of 
expression in the comparison tissue. 
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6. The method of claim 1 wherein the comparison tissue is from a subject known to 
have a mutation in BRCA-2 and the method comprises determining whether the ovarian tumor is a 
BRCA-2-like tumor by comparing the pattern of expression in the ovarian tumor to the pattern of 
expression in the comparison tissue. 

5 

7. The method of claim 1 wherein the comparison tissue is from a known BRCA-2- 
like tumor, and the method comprises determining whether the ovarian tumor is BRCA-2-like by 
comparing the pattern of expression in the ovarian tumor to the partem of expression in the 
comparison tissue. 

10 

8. The method of claim 7, wherein classifying the ovarian tumor comprises 
determining whether a tumor that does not contain a BRCA-1 or BRCA-2 mutation is BRCA-l-like 
orBRCA-2-Jike. 

15 9. The method of claim 1 wherein the comparison tissue is from a known non-BRCA- 

like tumor, and the method comprises determining whether the ovarian tumor is non-BRCA-like by 
comparing the pattern of expression in the ovarian tumor to the pattern of expression in the 
comparison tissue. 

20 10. The method of claim 1, wherein the standard ovarian tissue is tissue from an 

immortalized ovarian cell, ovarian tissue from a subject not having ovarian cancer, a subject not 
predisposed to developing ovarian cancer, or ovarian tissue from a subject from whom the ovarian 
tumor was obtained at an earlier point in time. 

25- • • 11 . The method of claim 1, wherein the patterns of expression are patterns of 

logarithmic expression ratios. 

12. The method of claim 1, wherein the patterns of expression are multidimensional 
scaling patterns. 

30 

13. The method of claim 12 wherein the multi-dimensional scaling patterns are visually 
compared to determine similarities. 

14. The method of claim 1, wherein the patterns of expression are hierarchical 
35 clustering patterns. 

15. The method of claim 14, wherein standard normal deviation values of the 
logarithmic expression ratios are assigned relative color intensities that assist in the visual 
comparison. 
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16. The method of claim 15, wherein the hierarchical clustering patters are visually 
compared to determine similarities, 

5 11 - 7116 method of claim 1 1 comprising comparing the logarithmic expression ratios of 

the plurality markers using compound covariate predictor analysis. 

1 8. The method of claim 1 1 , wherein the method comprises differentiating a BRCA 1 - 
like ovarian tumor from a sporadic ovarian tumor by comparing relative logarithmic expression ratios 

10 of at least one marker shown in Table 6. 

19. The method of claim 18, wherein differentiating a BRCAl-Uxk&d ovarian tumor 
from a sporadic ovarian tumor comprises comparing the relative logarithmic expression ratios of 
CD72 (SEQ ID NO: 805), SLC25A11 (SEQ ID NO: 544), LCN2 (SEQ ID NO: 545-547), PSTPIP1 

15 (SEQ ID NO: 538-540), SIAHBP1 (SEQ ID NO: 543), UBE1 (SEQ ID NO: 533), WAS (SEQ ED NO: 
524-526), IDH2 (SEQ ID NO: 541-542), oxPCTKl (SEQ ID NO: 527-528) in the ovarian tumor and 
comparison tissue. 

20. The method of claim 1 1, wherein the method comprises differentiating a BRCA2- 
20 like ovarian tumor from a non-BRCA-like ovarian tumor by comparing relative logarithmic 

expression ratios of at least one marker shown in Table 7. 

2 1 . The method of claim 20, wherein the method comprises comparing the relative 
logarithmic expression ratios of LOC51 760 (SEQ ID NO: 279) or LRPAP1 (SEQ ID NO: 280) to 

25 differentiate a BRCA2-like ovarian tumor froma non-BRCA like ovarian tumor 

22. The method of claim 21, wherein the method comprises differentiating a non- 
BRCA-like tumor from a BRCA-l-like or BRCA-2-like ovarian tumor by comparing relative 
logarithmic expression ratios of at least one marker shown in Table 8. 



30 



35 



23. The method of claim 22, wherein the method comprises comparing relative 
logarithmic expression ratios of PSTPIPJ (SEQ ID NO: 281), IDH2 (SEQ ID NO: 282), or PCTK1 
(SEQ ID NO: 274) to differentiate a combined BRCA J- and BRCA2-]inked ovarian tumor from a 
sporadic ovarian tumor. 

24. The method of claim 1 1 , wherein the method comprises differentiating a BRCA J- 
like ovarian tumor from a BRCA2-hke ovarian tumor by comparing relative logarithmic expression 
ratios of at least one marker shown in Table 10. 
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25. The method of claim 1, wherein the method further comprises selecting a treatment 
strategy based on classifying the ovarian tumor as BRCAl-tike, BRCA2- like or non-5/? CA -like. 

26. The method of claim 25, wherein the treatment strategy comprises selecting a more 
5 aggressive treatment regimen for a BRCAJAlke or BRCA2A&& tumor. 

27. The method of claim 26, wherein the treatment is chemotherapy, radiotherapy, or 
surgical removal of the affected tissue and/or surrounding area. 

i 

10 28. The method of claim 25, further comprising treating the subject with the selected 

treatment. 

29. The method of claim 11, wherein comparing the patterns of logarithmic expression 
ratios comprises comparing the logarithmic expression ratios to patterns of logarithmic expression 

15 ratios in a database of patterns associated with BRCAl-litoi, BRCA2-\ike or non-BRCA-like ovarian 
tumors. 

30. The method of claim 1 1 , wherein comparing patterns of logarithmic expression 
ratios of the plurality of markers comprises obtaining the partem of expression of the plurality of 

20 markers on an array. 

3 1 . The method of claim 1 , wherein the pattern of expression of the plurality of 
markers comprises over-expression of one or more markers compared to the standard. 

25 32. The method of claim 29, wherein the one or more markers that is overexpressed is 

listed in Table 5. 

33. The method of claim 32, wherein detenruning the pattern of expression comprises 
providing nucleic acid sequences of the markers, and performing nucleic acid hybridization of 

30 specific oligonucleotide probes to the nucleic acid sequences. 

34. The method of claim 33, wherein the sequence of the oligonucleotide probe is 
selected to bind specifically to a nucleic acid molecule listed in Table 1, 

35 35. The method of claim 34, further comprising amplifying the one or more markers 

prior to performing nucleic acid hybridization. 

3 6. The method of claim 33, further comprising quantitating hybridization to detect a 
level of differential expression. 
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37. The method of claim 33, wherein providing sequences of the markers comprises 
providing the nucleic acid sequences on an array carrying the plurality of markers. 

5 38. The method of claim 37, wherein the array is a cDNA microarray. 

39. The method of claim 33, wherein providing the nucleic acid sequences of the 
markers comprises providing at least 50 of the markers listed in Table 1. 

10 40. The method of claim 33, wherein providing the nucleic acid sequences of the 

markers comprises providing at least 100 of the markers listed in Table I. 

41. The method of claim 33, wherein providing the nucleic acid sequences of the 
markers comprises providing at least 200 of the markers listed in Table t. 

15 

42. A method of diagnosing or prognosing development or progression of ovarian 
cancer in a subject comprising detecting under-expression of one or more markers in Tabic 4 relative 
to a standard. 

20 43. The method of claim 42, wherein the standard is immortalized ovarian epithelial 

cells, ovarian tissue from a subject not having cancer or a subject not predisposed to developing 
cancer, or ovarian tissue from the subject at an earlier point in time. 

44. The method of claim 42 wherein die one or more markers comprise a nucleic acid 
25 . encoded by SEQ ID NOs: 449-503. • 

45 . A method of diagnosing or prognosing development or progression of ovarian 
cancer in a subject comprising detecting over-expression of one or more markers in Table 5 relative 
to a standard. 

30 

46. The method of claim 45, wherein the standard is immortalized ovarian epithelial 
cells, ovarian tissue from a subject not having cancer or a subject not predisposed to developing 

. cancer, or ovarian tissue from the subject at an earlier point in time. 

35 47. The method of claim 45 wherein the one or more markers comprise a nucleic acid 

encoded by SEQ ID NOs: 18-19, 30-31, 50-51, 52-54, 55-57, 58-59, 60, 68-69, 74-76, 85-86, 87-88, 
89-91, 92-93, 94-95, 97-99, 122-123, 133-135, 149-151, 164-166, 167-168, 169-170, 174-175, 176- 
178, 179-180, 181-182, 190-192, or 199-201. 
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48. A method of screening for an agent for treating or inhibiting ovarian cancer in a 
subject, comprising exposing a tumor cell to a therapeutically effective amount of a pharmaceutical 
compound that restores wild-type expression of at least one BRCAJ-like or BRCA2A&& marker listed 
in Table 1. 

5 

49. The method of claim 48 wherein the agent corrects under-expression or over- 
expression of a marker listed in Table 1. 

50. A method of monitoring a response to therapy for an ovarian tumor, comprising 
1 0 monitoring expression of the markers in the subject following adrninistration of the therapy. 

51. A method of diagnosing or prognosing development or progression of ovarian 
cancer in a subject comprising detecting differential expression of a gene that maps to Chromosome 
Xpll.2. 

15 

52. A kit for classifying one or more ovarian tumors as sporadic, BRCAJ-like or 
BRCA2-]ike tumors, comprising components for measuring expression levels of markers in the one or 
more ovarian tumor samples and for comparing the expression levels of the markers to the markers in 
Table 10. 

20 

53. The kit of claim 52, wherein the expression levels of a plurality of markers from 
each tumor are measured. 

54. The kit of claim 52, comprising an array carrying a plurality of markers. 

25 . . - •■ ' 

55 . The kit of claim 52, comprising a binding molecule that selectively binds to a 
marker in the one or more tumor samples, and wherein the marker is listed in Table 10. 

56. The kit of claim 52, wherein the expression levels measured are of a non-BRCA- 
30 like, BRCAl-like or BRCA2-tike tumor protein, and the binding molecule is an antibody or antibody 

fragment that selectively binds the tumor protein. 



35 



57. The kit of claim 52, wherein the expression levels measured are of a BRCA-like, 
BRCAJ-tike or BRCA2-like nucleic acid marker, and the binding molecule is an oligonucleotide 
capable of hybridizing to the nucleic acid molecule marker. 
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Figure 2A' 
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Figure 2C 
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Figure 3A 



CM 



CO 



o 

CD 




Z -2.5 -1.5 -0.5 0 0.5 1.5 2.5 



WO 03/068054 



7/60 



PCT/US03/04688 



Figitre 3 A' 




Z -2.5 -1.5 -0.5 0 0.5 1.5 2.5 



WO 03/068054 PCT/US03/04688 

8/60 



Figure 3B 




IIWWl— ■ I I'll I 
Z -2.5 -1.5 -0.5 0 0.5 1.5 2.5 



mmm 



WO 03/068054 



10/60 



PCT/US03/04688 



Figure 3C 

DESIGNATIONS IN FIGURE 3-A 
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SEO ID NO: 


1 
1 


B2-1 


PAK2 


557-558 


2 


B2-10 


NCSTN 


678 


3 


B2-16 


HGF 


585-586 


4 


B2-2 


BAD 


587-588 


5 


B2-20 


F23149 


766-767 


6 


B2-21 


DKFZP564C186 . 


763-764 


7 


B2-22 


UBL1 


510-512 


g 


B2-23 


GCAT 


741-742 


9 


B2-24 


RBBP4 


691-692 


10 


B2-25 


CALU 


375 


11 


B2-3 


RUNX1 


518-520 


12 


B2-4 


PTK2B 


624-625 


13 


B2-6 


FDFT1 


651-652 


14 


B2-7 


IL18R1 


618-620 


15 


B2-8 


P14L 


743-744 


16 


B2-9 


RALY 


712-713 


17 


B36 


KIAA0218 


703-704 


18 


B39 


MPI 


666-667 


19 


B40 


IL17R 


- 583-584 


20 


B41 


KIAA0008 


745-746 


21 


B52 


IL1B 


775-776 


22 


B54 


RAB3A 


739-740 


23 


B55 


HARS 


662-663 


24 


B60 


TUFM 


659 


25 


B61 


PEF 


719-720 


26 


B62 


GNB2 


674-675 


27 


B63 


SECRET 


655-656 


28 


B64 


SLC9A1 


657-658 


29 


B70 


NAGA 


653-654 


30 


B74 


MNAT1 


660-661 
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Figure 3C, continued 



DESIGNATIONS IN FIGURE 3-A 
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Figure 3D 
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Figure 3D 
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Figure 3D, continued 

DESIGNATIONS IN FIGURE 3-B 
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C16 . 
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CI 
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C20 


CUGBP1 
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C41 


PON1 
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C42 
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FU21661 
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Figure 4A-C 
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Figure 5A 



i 



T 



i 

4^ 



T 



I 



T-nlTlft 



III 



1 




*j d o o ^ 




WO 03/068054 



18/60 



PCT7US03/04688 



Figure 5B 
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Figure 6 A 
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Figure 6B 
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Table 1. Markers that were Differentially Expressed in a cDNA Microarray 
Expression Profile of Sixty-One Ovarian Cancer Tumors 



SEQ 
ID 
NO. 


Gene 


IMAGE 
ID No. 


UniGene 
No. 


GenBank 
Accession 
No. 


Gene Description 


16- 
17 


BCKDHB 


770835 




AA427739; 
AA434304 


Branched chain keto 
acid dehydrogenase 
El, beta polypeptide 
(maple syrup urine 
disease) 


18- 
19 


SERPINF 
2 


82195 




T68859; 
T68934 


serine (or cysteine) 
proteinase inhibitor, 
clade F (alpha-2 
antiplasmin, pigment 
epithelium derived 
factor), mem 


20- 
22 


ZNF33A 


346902 




D31763; 
W78164; 
W79207 


KIAA0065 


23- 
24 


ZNF33A 


246543 




N57658; 
N77515 


Zinc finger protein 33a 
(KOX31) 


25 


EST 


192198 




H41144 


Unknown 


26- 
27 


EST 


128738 




R16726; 
R09980 


Homo sapiens 
cDNA:FLJ23371 fis, 
clone HEP16068, 
highly similar to 
HSTFHSH Homo 
sapiens mRNA for trar 


28- 
29 


EST 


429211 




AA007283; 
AA007282 


ESTs 


30- 
31 


FLJ22174 


295939 




N67034; 
W04283 


hypothetical protein 
FLJ22174 


32- 
33 


EST 


415562 j 




W80701; 
W78802 


Unknown 


34- 
35 


EST 


296488 




N70208; 
W01059 


Unknown 


36- 
37 


EST 


120124 




T95064; 
T95160 


ESTs 


38- 
39 


EST 


132142 




R26164; 
R23610 


Homo sapiens 
cDNA:FLJ21587 fis, 
clone COL06946 


40 


EST 


50635 




H17921 


ESTs 


41- 
43 


POR 


234180 




S90469; 
H70626; 
H66247 


Cytochrome P450 
reductase 


44- 
45 


CLU 


725877 




AA292226; 
AA292410 


Clusterin (complement 
lysis inhibitor, SP- 
40,40, sulfated 
glycoprotein 2, 
testosterone-repressed 
prostate messenger 


46- 
47 


EST 


73702 




T54544; 
T54585 


Unknown 


48 


EST 


2218314 




AI744768 


Unknown 


49 


. EST 


2261113 




AI609063 


EST 



1 
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50- 
51 


IFITM1 


j 755599 




AA419251; 
AA419286 


Interferon induced 
transmembrane protein 
1 (9-27) 


52- 
54 


IFITM1 


509641 




J04164; 
AA058323; 
AA058453 


Interferon-inducible 
protein 9- 

27=interferon-induced 
17kDa membrane 
protein 


55- 
57 


EFITM2 


624655 




X57351; 
AA187365; 
AA1 87099 


Interferon-induced 
protein 1-8D 


58- 
59 


IFITM2 


376520 




AA041402; 
AA041501 


Interferon-induced 
protein 1-80 


60 


HE4 


786675 




AA451904 


Epididymis-specific, 
whey-acidic protein 
type, four-disulfide 
core; putative ovarian 
carcinoma marker 


61- 
62 


KIAA020 
3 


61008 




T40715; 
T39659 


KIAA0203 gene 
product 


63- 
64 


IL8RB 


882183 




M73969; 
AA480683 


CXCR2-IL-8 
Receptor beta 


65- 
67 


VDUP1 


297954 




S73591; 
N68956; 
W00656 


Brain-expressed 
HHCPA78 
homolog=Induced in 
HL60 cells treated 
with vitamin D or 
cycloheximide 


68- 
69 


G1P3 


782513 




AA448478; 
AA432030 


Interferon, alpha- 
inducible protein 
(clone IFI-6-16) 


70- 

! 72 


BST2 


811024 




D28137; 
AA485371; 
AA485528 


Bone marrow stromal 
cell antigen 2 


73 


SIAT1 


897906 




AA598652 


Sialyltransferase 1 
(beta-galactoside 
alpha-2,6- 
sialyltransferase) 


74- 
76 


DDR1 


182288 




U48705; 
H41900; 
H41939 


Receptor protein- 

tyrosineTdnase 

EDDR1 


77 


PL1 


433573 




AA701655 


Human endogenous 
retrovirus envelope 
region mRNA (PL1) 


78- 
79 


EST 


108422 




T77847; 
T77926 


Homo sapiens, clone 
MGC: 12275, mRNA, 
complete cds 


80 


CEBPG 


455121 




AA676804 


CCAAT/enhancer 
binding protein 
(C/EBP), gamma 


81- 
82 


MUC1 


840687 




AA488073; 
AA486365 


Mucin 1, 
transmembrane 


83- 
84 


CP 


223350 




H86554; 
H86642 


ceruloplasmin 
(ferroxidase) 


85- 
86 


HLA- 
DRB5 


811139 




AA485739; 
AA486460 


Major 

histocompatibility 
complex, class n, DR 
betaS 


87- 


HLA- 


417711 




W88967; | Major 



2 
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88 


DRB1 






W88546 


histocompatibility 
complex, class II, DR 
beta 1 


89- 
91 


CD74 


725751 




X00497; 
AA399225; 
AA292218 


Invariant chain=la- 
associated invariant 
gamma-chain 


92- 
93 


CD74 


840681 




AA488071; 
AA486363 


Invariant chain=la- 
associated invariant 
gamma-chain 


94- 
96 


HLA- 
DRA 


117411 




K01171; 
T89719; 
T89816 


MHC Class II=DR 
alpha 


97- 
99 


HLA- 
DPA 


207715 




X00457; 
H62294; 
H62293 


MHC Class n=DP 
alpha 


100 


HLA- 
DRB1 


855547 




AA664195 


Major 

histocompatibility 
complex, class II, DR 
beta 1 j 


101- 
103 


HLA- 
DRB1 


470953 




M20430; 
AA032179; 
AA033653 


MHC Class H=DR 
beta 


104- 
106 


TNFSF10 


203132 




U57059; 
H54629; 
H54628 


TRAIL=Apo-2 ligand 


107- 
109 


H2AFL 


429091 




U90551; 
AA007585; 
AA007574 


Histone-2A-like 
protein (H2A/I) 


310- 
111 


IG 
lambda 


66560 




T67053; 
T67054 


Immunoglobulin 
lambda locus 


112- 
114 


IGKC 


159142 




M63438; 
R76324; 
R76323 


Immunoglobulin 
kappa light chain 


115- 
116 


IGKC 


840451 




AA485725; 
AA485862 


Immunoglobulin 
kappa light chain 


117- 
119 


RAD23A 


293925 




AF004230; 
N63943; 
N98412 


MIR- 

7=monocyte/macropha 
ge lg-related receptor 
AND RAD23=UV 
excision repair protein 
(Double hit) 


120- 
121 


SCYB10 


967284 




X02530; 
AA527139 


IP-10 


122- 
123 


RGS1 


361323 




AA017544; 
AA017417 


regulator of G-protein 
signaling 1 


124- 
126 


RGS1 


686248 




S59049; 
AA262268; 
AA262879 


BL34=RGS 1 =regulato 
r of G-protein 
signaling which 
inhibits SDF-1 
directed B cell 
migration 


127- 
129 


GAS1 


365826 




L13698; 
AA025819; 
AA025884 


Growth arrest-specific 
1 


130- 
132 


BTG2 


358214 




U72649; 
W95415; 
W95512 


BTG2=p53 dependent 
inducible anti- 
proliferative gene 
homologous to 
Pc3/Tis21 immediate 



3 



24/60 













early genes 


133- 
135 


TAP 

FOS 


755279 




\ V01512; 
AA496353; 


C-IOS 


136 


LSR68 


1862044 




AI053597 


Lipopolysaccharide 
specinc icsponse-oo 
protein 


137- 
1 1R 

1 JO 


JUNB 


309864 




N94468; 

W91R47 

VV ZjOH / 


Jun B proto-oncogene 


139- 
140 


JUNB 


122428 




T99236; 
T99280 


Jun B proto-oncogene 


141- 
143 


C0L3A1 


122159 




X14420; 
| T98612; 
T n R61 1 


Collagen Type III 
Alpha 1 


144- 
145 


LUM 


813823 




AA447781; 
AA453712 


lumican 


1 

148 


T7QT 


zy^DUO 




uyuvio, 
N71007; 
W0 1902 

VV V/17U& 


clone zoo i j ttlkin a 


149- 
151 


SORJLl 


279388 




Y08110; 
N48698; 
N45548 


Mosaic protein 

LR1 l=hybrid receptor 


152- 
153 


KNASE6 
PL 


712341 




AA405000; 
AA281840 


Ribonuclease 6 
precursor 


1 

156 


TTT A r> 


7607^1 




IYIZoZUj, 

AA429012; 
AA429162 


xiunian i^euKocyis 
Antigen B 


1 *7 

159 


TTT A C 


oi niiio 
o lUi^Z 




1VJL1 1550, 

AA464246; 
AA464354 


Human jLeuKOcyie 
Antigen C 


161 


OJT WIN X 






H09099- 
H09449 


Snfinrfin 1 ff-^nnnrHn^ 
extracellular matrix 
protein 


163 


XloJxiNrVO 

EB 


R14S96 

O ItJZU 




/U\*tJ7JUJj 

AA459588 


RNA-TrinHint* rpuinn 

(RNP1, REM) 
contoining 1 


1AA 
104- 

166 




O IjZJO 




lVll*r / JO, 

AA455911; 
AA456377 


IViX/ J\-l 1V1 U 1 U.LLL 

resistance protein 1=P- 
glycoprotein 


167- 

100 


ZFP36 


23804 




R38383; 

T7740Q 


Zinc finger protein 

Yi Atnr*loorr»nc tn 7"fr\-.16 
JJAJIIlUlUgUUo IU tjip J \J 

in mouse 


169- 
171 


ZFP36 


135880 




M63625; 
R33813; 
R33R12 

I\JJ Olb 


TTP==tristetraproline= 
GOS24=zinc finger 

trfln^f!ri"ntinTia1 

regulator 


172- 

1 / J 


ZFP36 


727266 




AA411987; 
AA402178 


TTP==tristetraproline= 

transcriptional 
regulator 


174- 
175 


HGF 


41650 




R52798; 
R52797 


hepatocyte growth 
factor (hepapoietin A; 
scatter factor) 


176- 
178 


SGK 


840776 




AJ000512; 
AA486082; 
AA486140 


sgk=putative 
serme/tlueoriine 
protein kinase 
transcriptionally 
modified during 
anisotonic and isotonic 
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alteration 


179- 
180 


PPP1R7 


814508 




AA459351; 
AA459572 


Protein phosphatase 1, 
regulatory subunit 7 


181- 
182 


CD24 


204335 




H59916; 
H59915 


CD24 antigen (small 
cell lung carcinoma 
cluster 4 antigen) 


183- 
184 


TPD52 


814306 




AA459100; 
AA459318 


Tumor protein D52 


185- 
187 


CXCR4 


79629 




X71635; 
T62636; 
T62491 


CXC chemokine 
receptor 

4=fusin=neuropeptide 
Y receptor=L3 


188- 
189 


JUND 


767784 




AA418670; 
AA4 18676 


Jun D proto-oncogene 


190- 
192 


BRF2 


485770 




U07802; 
AA039882; 
AA039967 


Tis lld=ERF- 
2=growth factor early 
response gene 


193- 
195 


A2M 


377647 




M11313; 
AA055995; 
AA055907 


Alpha-2- 
macroglobulin 


196 


EST 


1384797 




AA856938 


Homo sapiens rnRNA; 
cDNA 

DKFZp434O0227 
(from clone 
DKFZp434O0227) 


197- 
198 


CD24 


196519 




S75311; 
R91610 


CD24 


199- 
201 


TLR3 


144675 




U88879; 
R76099; 
R76150 


TLR3= Toll-like 
receptor 3 


202 


ITM2A 


878596 


Hs.17109 


AA775257 


integral membrane 
protein 2A 


203- 
204 


GATM 


42558 


Hs.75335 


R61229; 
R61228 


glycine 

amidinotransferase (L- 

argininerglycine 

amidinotransferase) 


205- 
207 


RNASE4 


81417 


Hs.283749 


D37931; 
T60163; 
T60223 


ribonuclease L (2 ! ,5- 

oligoisoadenylate 

synthetase-dependent) 


208- 
210 


LAMA2 


471642 


Hs.75279 


• Z26653; 

AA034939; 

AA034938 


larriinin alpha 2 
(merosin, congenital 
muscular dystrophy) 


211 


PBX3 


448386 


Hs.294101 


AA778198 


pre-B-cell leukemia 
transcription factor 3 


212 


PLA2G6 


1472538 


Hs.120360 


AA872271 


phospholipase A2, 
group VI (cytosolic, 
calcium-independent) 


213 


SMARC 
A2 


814636 


Hs.198296 


AA481026 


SWI/SNF related, 
matrix associated, 
actin dependent 
regulator of chromatin, 
subfamily a, member 2 


214- 
215 


CUGBP2 


488956 


Hs.211610 


AA047257; 
AA057142 


CUG triplet repeat, 
RNA-binding protein 
2 


216- 
218 


TGFBR3 


209655 


Hs.79059 


L07594; 
H62473; 
H61499 


TGF beta receptor 
type III 


219 


STAR 


859858 


Hs.3132 


AA679454 


steroidogenic acute 
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regulatory protein 


220 


frNfil 1 

VJ i> KJ ± I 




"He R17R1 


A A 00OOA1 


guanine nucleotide 
binding protein 11 


221 


QTED2 


491565 


Hs.82071 


AA1 15076 


Cbp/p3 0 O-interacting 
trails activator, witn 
Glu/Asp-rich carboxy- 
terrriinal domain, 2 


222 


1 


744 A47 


jtIS.jS^oo 


A A £7 171^ 


catenin (cadherin- 
associated protein), 
alpha-like 1 


223 


ABCA8 


743773 


Hs.38095 


AA634308 


ATP-binding cassette, 
sub-family A (ABC1), 
member 8 


974 

226 




1 CQOI*? 
IooZjZ 


tt„ naiA 

Jtls, /yj4 


Ar 105036; 
H45668; 
H45711 


GKLF=EZF= s KLF4=g 
ut-enriched Kruppei- 
like zinc finger 
protein=expressed in 
vascular endothelial 
cells 


227- 
228 


ITPR1 


471725 


Hs.198443 


AA035450; 

A A 03 5477 


inositol 1,4,5- 
mpnospnate receptor, 
typel 


229- 
230 


MAF 


487793 


Hs 307^0 


A A043501- 
AA044658 

AAwTTuJ O 


v-maf 

1 iiUoU U1U OU UliC UIU LIU 

fibrosarcoma (avian) 
oncogene hornolog 


231- 
232 


FOXC1 


768370 


Hs 284186 


AA495790* 
AA495846 


ftvrkhpflH bnv PI 


233 


TCF21 


461351 


Hs 78061 


AA6QQ7K7 


fTOTtOf rn^ti/^ti -fVi/>i"r\f 7 1 

ucuiouopuun jLdcior z i 


234- 
236 


CCNI 


248295 


Hs.79933 


D50310; 
N^R^1 1- 

nJOJ 1 1, 

N78101 


CyclinI 


237- 
238 


DCN 


209367 


Hs.76152 


H64138; 


decorin 


239- 
240 


CBF2 


789049 


Hs.184760 


AA452909; 
AA453077 


CCAAT-box-binding 
transcription factor 


241- 
242 


FST 


68049 




I JZojU, 

T52829 


nomo sapiens, cione 
IMAGE:4183312, 
rnRNA, partial cds 


■ 122 


RNASE4 


81417 


Hs.283749 




ribonuclease L (2',5'- 

vju^uiauauciiyiaiC/ 

synthetase-dependent) 


243 


SLC4A 1 
AP 




Uo ^06000 




suiuic Laxncr lamiiy t 
(anion exchanger), 
member 1, adapter 


244- 
245 


GSTM5 


377731 


Hs.75652 


AA056232; 
AA056231 


glutathione S- 
transferase M5 


246 


C4BPB 


460470 


Hs 99886 


AA677687 


conrol ement 
component 4-binding 
protein, beta 


247- 
248 


HS3ST1 


73609 


Hs.40968 


T55714; 
T55756 


heparan sulfate 
(glucosamine) 3-0- 
sulfotransferase 1 


249 


CDKN1C 


2413955 


Hs.106070 


AI828088 


cyclin-dependent 
kinase irihibitor 1C 
( P 57,Kip2) 


250 


HNRPDL 


897823 


Hs.170311 


AA598578 


leterogeneous nuclear 
ribonucleoprotein D- 
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like 


251 


CERBP 


1558799 


Hs. 11 9475 


AA977242 


cold inducible RNA- 
binding protein 


252 


RGS2 


2321596 


Hs.78944 


AI675670 


regulator of G-protein 
signalling z, z**kd 


253- 
254 


TCEAL1 


786607 


Hs.95243 


AA478480; 
AA451969 


transcription 
elongation factor A 
(Mlj-uke 1 


255- 
256 


CAV1 


377461 


Hs.323469 


AA055835; 
AA055368 


caveolin 1, caveolae 
protein, 22kD 


Z J / 


AT TYH"1 A 

l 






A A CCA 1 f\1 


aldehyde 
dehydrogenase 1 
family, member A 1 


258- 
259 


RBPMS 


343443 


Hs.80248 


W67200; 
W67323 


RNA-binding protein 
gene with multiple 
splicing 


260- 
261 


ADAMT 
SI 


62263 


Hs.8230 


T41173; 
T40309 


a disintegrin-like and 
metalloprotease 
(reprolysin type) with 
thrornbospondin type 

1 mn+if 1 

i mom, i 


262- 
263 


DSCR1L 
1 


51408 


Hs.156007 


H19440; 
H19439 


Down syndrome 
critical region gene 1- 
jute i 


264 


DLK1 


436121 


Hs.169228 


AA701996 


delta-like homolog 
Prosophila) 


265- 
266 


CDH11 


491113 


Hs.75929 


AA136983; 
AA137109 


cadherin 11, type 2, 

OB-cadherin 

(osteoblast) 


138 


SGK 


840776 


Hs.296323 




sgl<?=putative 
serine/threonine 
protein kinase 
transcriptionally 
modified during 
anisotonic and isotonic 
oiiciauuns sjl ueii 
volume 


767 


HFT 1 


4 "50060 


Ho 078Sfi8 




JJl laVLUl 

(complement)-like 1 


169.- 
269 




jOJt / 


He Iflfi^flQ 
na.lUDJU? 


R4Q43Q- 

R35921 


Friend nf HATA9 i 


270 


C7 


898122 


Hs.78065 


AA598478 


complement 

UUIIipUIJCIlL / 


271- 


SGCE 


784109 


Hs.l 10708 


AA432066; 


sarcoglycan, epsilon 


213 


NBL1 


898305 


Hs.76307 


AA598830 


neuroblastoma, 
suppression of 
tumoripenicitv 1 


274- 
275 


HBB 


173385 


Hs.l 55376 


H20968; 
H21011 


hemoglobin, beta 


776- 

278 


PARP 


840683 




XR3703- 
AA488072; 
AA486364 


PvtnVine indnfiiftlp 

nuclear protein 


279- 
280 


MITF 


278570 


Hs.l 66017 


N66177; 
N99168 


microphthalmia- 
associated 
transcription factor 


281- 
282 


CDC20 


898062 


Hs.82906 


U05340; 
AA598776 


p55CDC . 


283- 


EPS8 


148028 


Hs.2132 


U12535; 


epidermal growth 
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285 








H13623; 
H13622 


factor receptor kinase 
substrate (Eps8) 


286- 
287 


ARHI 


345680 


Hs. 194695 


W72033; 
W76278 


ras homolog gene 
family, member I 


288 


B4-2 


857002 


Hs.75969 


AA669637 


proline-rich protein 
with nuclear targeting 
signal 


289- 
291 


SELE 


186132 


Hs.89546 


M30640; 
H39991; 
H39560 


ELAMl=endothelial 
leukocyte adhesion 
molecule I 


292- 
293 


PMP22 


133273 


Hs.103724 


R26732; 
R26960 


peripheral myelin 
protein 22 


294 


EBAF 


340657 


Hs.25195 


W56771 


endometrial bleeding 
associated factor (left- 
right detennination, 
factor A; transforming 
growth factor beta 
superfamily) 


295- 
296 


PRKAR2 
B 


609663 


Hs.77439 


AA1 80007; 
AA181500 


protein kinase, cAMP- 
dependent, regulatory, 
type II, beta 


297 


NFKBIE 


1573311 


Hs.91640 


AA953975 


nuclear factor of kappa 
light polypeptide gene 
enhancer in B-cells 
inhibitor, epsilon 


299 


JSi 1 




IIS. 5 lOO J 


XTO a on A . 

N36279 


v-kit Hardy- 
Zuckerman 4 feline 
sarcoma viral 
oncogene nomoiog 


158 


JUNB 


309864 


Hs.198951 




jun B proto-oncogene 


300 


BCKDK 


1573108 


Hs.20644 


AA970731 


branched chain alpha- 
ketoacid 

dehydrogenase kinase 


301- 
303 


BTG1 


298268 


Hs.77054 


X61123; 
N70463; 
W03824 


BTGl=B-cell 
translocation gene 
l=anti-proliferative 


304- 
305 


AKAP12 


784772 


Hs.788 


AA478543; 
AA478542 


A kinase (PRKA) 
anchor protein 
(gravin) 12 


306- 
307 


NR4A2 


89B221 


Hs.82120 


S77154; 
AA598611 


NOT=Immediate early 

response 

protein=Nurrl 

homologue=Niur77 

orphan steroid 

receptor family 

member 


308- 
309 


HBB 


126531 


Hs.155376 


R06757; 
R06806 


hemoglobin, beta 


310- 
311 


ARHGAP 
6 


768489 


Hs.250830 


AA495981; 
AA425035 


Rho GTPase 
activating protein 6 


312 


PLS3 


1568391 


Hs.4114 


AA953747 


plastin 3 (T isoform) 


313- 
314 


FNTA 


300015 


Hs.138381 


N78902; 
W06970 


farnesyltransferase, 
CAAX box, alpha 


315- 
316 


TNFAEP3 


770670 


Hs.211600 


AA476272; 
AA433807 


tumor necrosis factor, I 
alpha-induced protein 
3 


317- 
318 


EGR1 


840944 


Hs.326035 


AA486533; 
AA486628 


early growth response 
1 


319 


RNAC 


795213 


Hs. 113052 


AA453591 


RNA cyclase homolog 
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320 


PA26 


813584 


Hs.14125 


AA447661 


p53 regulated PA26 
nuclear protein 


321 


Cllorfl3 


1573778 


Hs.72925 


AA970526 


chromosome 1 1 open 
reading frame 13 


322 


ING1L 


2169465 


Hs. 107153 


AI564029 


inhibitor of growth 
family, member 1-like 


323 


RPL9 


2577249 


Hs. 157850 


AW075605 


ribosomal protein L9 


324- 
325 


ADH5 


813711 


Hs.78989 


AA453776; 
AA453859 


alcohol dehydrogenase 
5 (class UT), chi 
polypeptide 


326- 
327 


FZD7 


298122 


Hs. 173859 


N69049; 
W00697 


frizzled (Drosophila) 
homolog 7 


328- 
329 


MATN2 


366100 


Hs. 19368 


AA071473; 
AA082338 


matrilin 2 


330- 
331 


SLC11A3 


71863 


Hs.5944 


T52564; 
T57235 


solute carrier family 
11 frjrotOTi-cfliinlpH 
divalent metal ion 
transporters), member 
3 


332- 
333 


EST 


767641 


Hs.122460 


AA418293; 
AA418356 


ESTs 


334- 
335 


ERCC5 


292463 


Hs.48576 


N62586; 
N80359 


excision repair cross- 
complementing rodent 
repair deficiency, 
complementation 
group 5 (xeroderma 
pigmentosum, 
complementation 
group G (Cockayne 
syndrome)) 


336- 
• 337 


MGC247 
9 


43771 


Hs.79625 


H05655; 
H05654 


hypothetical protein 
MGC2479 


338- • 
339 


RPL21 


810617 


Hs. 184108 


AA464743; 
AA464034 


ribosomal protein L21 


182 


CD24 


204335 


Hs.286124 




CD24 


340- 
341 


SLPI 


378813 


Hs.251754 


X04470; 
AA683520 


Secretory leukocyte 
protease inhibitor 


342 


SPP1 


378461 


Hs.313 


AA775616 


Secreted 
phosphoprotein 1 
(osteopontin, bone. 
sialoprotein I, early T- 
lymphocyte activation 
1) 


343- 
344 


BF 


741977 


Hs.69771 


L15702; 
AA401441 


B-factor, properdin 


345- 
347 


CKS1 


810899 


Hs.334883 


X54941; 
AA459292; 
AA459522 


CDC28 protein kinase 
1 


348- 
349 


MMP7 


470393 


Hs.2256 


AA031514; 
AA031513 


Matrix 

metalloproteinase 7 
(matrilysin) 


350- 
351 


PAX8 


742101 


Hs.73149 


AA405767; 
AA405891 


Paired box gene 8 


352- 
353 


SPINT2 


814378 


Hs.31439 


AA458849; 
AA459039 


Serine protease 
inhibitor, Kunitz type, 
2 


354 


ZWINT 


451907 


Hs.42650 


AA706968 


ZWIOinteractor 


355 


DGKH 


2544675 


Hs.159073 


AW052032 


Diacylglycerol kinase, 
eta 
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356 


HMGIY 


782811 


Hs. 139800 


AA448261 


HMGIY High- 
mobility group 
(nonhistone 
chromosomal) protein 
isofonns I and Y 


357- 
359 


SDC4 


504763 


Hs.252189 


X67016; 
AA148737; 
AA148736 


Syndecan 4 

(amphiglycan, 

ryudocan) 


360 


CDKN2A 


! 1161155 


Hs.1174 


AA877595 


Cyclin-dependent 
kinase inhibitor 2A 
(melanoma, pi 6, 
inhibits CDK4) 


361- 
362 


SCNN1A 


810873 


Hs.2794 


AA458982; 
AA459197 


Sodium channel, 
nonvoltage-gated 1 
alpha 


363 


LDHA 


43550 


Hs.2795 


H05914 


Lactate dehydrogenase 
A 


364- 
365 


FOLR1 


131839 


Hs.73769 


R24530; 
R24635 


Folate receptor 1 


366- 
367 


TPI1 


855749 


Hs.83848 


M10036; 
AA663983 


Triosephosphate 
isomerase 1 


368 


KLK8 


2514426 


Hs.104570 


AI963941 


KallikreinS 
(neuropsin/ovasin) 


200 


CXCR4 


79629 


Hs.89414 




Chemokine (C-X-C 
motif), receptor 4 
(fusin) 


369- 
370 


KNSL1 


825606 


Hs.8878 


AA504625; 
AA504719 


Kinesin-like 1 


371- 
372 


H2AFO 


488964 


Hs.795 


AA047260; 
AA057146 


H2A histone family, 
member O 


373- 
374 


HLA- 
DRA 

• 


153411 


Hs.76807 


R47979; 
R48091 


Major 

histocompatibility 
complex, class II, DR 
alpha 


375 


CRIP1 


1323448 


Hs. 17409 


AA873604 


Cysteine-rich protein 1 


376 


PP 


950700 


Hs. 184011 


AA608572 


pyrophosphatase 
(inorganic) 


377- 
378 


EST 


666391 




AA232895; 
AA232894 




379- 
381 


SLC2A1 


207358 


Hs.l 69902 


-K03195; 
H58873; 
H58872 


Solute carrier family 2 
(facilitated glucose 
transporter), member 1 


382 


EST 


897770 




AA598508 




383- 
385 


HDGF 


813673 


Hs.89525 


D16431; 
AA453749; 
AA453831 


Hepatoma-derived 
growth factor (high- 
mobility group protein 
l-like) 


386 


ASS 


882522 


Hs.l 60786 


AA676466 


Argininosuccinate 
synthetase 


387- 
388 


CLDN4 


770388 


Hs.5372 


AA430665; 
AA427468 


Claudin4 


389 


PRAME 


897956 


Hs.30743 


AA598817 


preferentially 
expressed antigen in 
melanoma 


390- 
391 


PTPRF 


897788 


Hs.75216 


Y00815; 
AA598513 


Protein tyrosine 
phosphatase, receptor 
type,F 


392- 
393 


EYA2 


741139 


Hs.29279 


AA402754; 
AA402207 


Eyes absent 
(Drosophila) homolog 
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2 


394- 
396 


MYCL1 


138917 


Hs.92137 


• M19720; 
R62813; 
R62862 


v-myc 

myelocytomatosis 
viral oncogene 
homologj 


397- 
399 


STAT1 


840691 


Hs.21486 


M97935; 
AA488075; 
AA486367 


Signal transducer and 
activator of 
transcription 1 


400- 
401 


MTCH2 


564492 


Hs.279609 


AA121668; 
AA121740 


mitochondrial carrier 
homolog 2 


402 


HTR3A 


435597 


Hs.2142 


AA703169 


5-hydroxytryptamine 
(serotonin) receptor 
3A 


403- 
404 


CCNE1 


68950 


Hs.9700 


T54121; 
T54213 


CyclinEl 


405 


CDH6 


739155 


Hs.32963 


AA421819 


Cadherin 6, type 2, K- 
cadherin 


406- 
408 


PRKAG1 


531028 


Hs.3136 


U42412; 
AA070495; 
AA070381 


Protein kinase, AMP- 
activated, gamma 1 
non-catalytic subunit 


409 


DEFB1 


2403485 


Hs.32949 


AI769855 


Defensin, beta 1 


410- 
411 


ARPC1B 


626502 


Hs.11538 


AA188179; 
AA188155 


Actin related nrotein 

A LVVU1 X V<4V4 VV\* r 1 1 J 

2/3 complex, subunit 
IB (41 kD) 


412- 
414 


PRKCI 


71622 


Hs.1904 


L33881; 
T57875; 
T57957 


Protein kinase C, iota 


415 


GAPD 


1610448 


Hs.169476 


AA991856 


glyceraldehyde-3- 

phosphate 

dehydrogenase 


416- 
417 


C2 


85497 


Hs.2253 


T71879; 
T71878 


Complement 
Component C2 


418- 
419 


H2AFY 


843075 


Hs.75258 


AA488627; 
AA486003 


H2A histone family, 
member Y 


420- 
421 


TM4SF1 


840567 


Hs.3337 


AA487893; 
AA488005 


Transmembrane 4 
superfamily member 1 


422- 
423 


GAPD 


50117 


Hs.169476 


H16958; 
H16957 


glyceraldehyde-3 - 

phosphate 

dehydrogenase 


424- 
426 


EFITM3 


809910 - 


Hs.182241 


X57352; ■ 
AA464417; 
AA464416 


interferon induced 
transmembrane protein 
3 (1-8U) 


427- 
428 


GLDC 


248261 


Hs.27 


N58494; 
N78083 


Glycine 
dehydrogenase 
(decarboxylating; 
glycine decarboxylase, 
glycine cleavage 
system protein P) 


429- 
430 


CALU 


144881 


Hs.7753 


R78586; 
R78585 


Calumenin 


431- 
432 


HBA2 


208764 


Hs.272572 


H63096; 
H63182 


Hemoglobin, alpha 2 


433 


S100A11 


810612 


Hs.256290 


AA464731 


SI 00 calcium-binding 
protein All 
(calgizzarin) 


434- 
436 


LDHA 


897567 


Hs.2795 


X02152; 
AA497029; 
AA489611 


Lactate dehydrogenase 
A 


437 


UBE2C 769921 


Hs.93002 


AA430504 


Ubio^itin-conjugatinfi 
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enzyme E2C 


438- 
440 


E2F3 


304908 


Hs.1189 


D38550; 
N92519; 
W38841 


E2F transcription 
factor 3 


441- 
442 


CDH1 


251019 


Hs. 194657 


Z13009; 
H97778 


Cadherin 1, type 1, E- 
cadherin (epitheHal) 


443- 
444 


PSME2 


210405 


Hs. 179774 


H65395; 
H65394 


Proteasome (prosome, 
macropain) activator 
subunit2(PA28 beta) 


445- 
447 


BMP7 


344430 


Hs. 170195 


X51801; 
W73473; 
W73527 


bone morphogenetic 
protein 7 (osteogenic 
protein 1) 


448 


TOP2A 


825470 




AA504348 


topoisomerase (DNA) 
H alpha fl70kD) 


449- 
451 


EL8 


328692 


Hs.624 


M17017; 
W45324; 
W40283 


interleukin 8 


452- 
453 


GROl 


324437 


Hs.789 


X54489; 
W46900 


GROl oncogene 
(melanoma growth 
stimulating activity, 
alpha) 


454- 
456 


ALDH1A 
3 


272686 


Hs.75746 


U07919; 
N32289; 
N44575 


aldehyde 
dehydrogenase 1 
family, member A3; 
Aldehyde 
dehydrogenase 6 


457- 
459 


MMP1 


624924 


Hs.83169 


X54925; 
AA181875; 
AA186634 


matrix 

metalloproteinase 1 

(interstitial 

collagenase) 


460- 
461 


OSF-2 


897910 


Hs.136348 


D13666; 
AA598653 


osteoblast specific 
factor 2 (fasciclin I- 
like) 


462- 
464 


CDC25B 


48398 


Hs.153752 


S78187; 
H14343; 
H14392 


cell division cycle 
25B; M-phase inducer 
phosphatase 2 


465- 
467 


FLNA 


487418 


Hs. 195464 


X53416; 
AA046721; 
AA046606 


filamin A, alpha 
(actin-binding protein- 
280) 


ACQ 

468- 
469 


TFP12 


726086 


Hs.295944 


AA399473; 
AA293402 


tissue. factor pathway 
inhibitor 2 


470- 
472 


FGF2 


23073 


Hs.284244 


M27968; 
R38539; 
T75110 


fibroblast growth 
factor 2 (basic) 


473- 
475 


CD44 


713145 


Hs.169610 


X56794; 
AA283090; 
AA282906 


CD44 antigen; 
extracellular matrix 
receptor- 
III=Hyaluronate 
receptor 


476- 
477 


DYT1 


69046 


Hs.19261 


T54320; 
T53726 


dystonia 1, torsion 
(autosomal dominant; 
torsin A) 


478 


UCHL1 


878833 


Hs.76118 


AA670438 


ubiquitin carboxyl- 
terminal esterase LI 
(ubiquitin 
thiolesterase) 


479- 
480 


PLAU 


1696513 


Hs.77274 


D00244; 
AI088434 


plasminogen activator, 
urokinase 


256 


LDHA 


43550 


Hs,2795 




lactate dehydrogenase 
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A 


481- 
483 


PTGS2 


147050 


Hs.196384 


U04636; 
R80217; 
R80322 


cyclooxygenase-2 ; 
prostaglandin 
endoperoxide 
synthase-2 


484- 
486 


PRNP 


682013 


Hs.74621 


M13899; 
AA256322; 
AA256449 


prion protein 


487- 
488 


MT1X 


297392 


Hs.278462 


N80129; 
W03653 


metallothionein 1L, 
metallothionein IX 


489- 
490 


UGB 


81336 


Hs.2240 


T63761; 
T63800 


uteroglobin 


491- 
493 


PBEF 


594539 


Hs.239138 


U02020; 
AA169813; 
AA171651 


pre-B-cell colony- 
enhancing factor 


494- 
496 


TXNRD1 


789376 


Hs. 13046 


X91247; 
AA464849; 
AA453335 


thioredoxin reductase 
1;GRIM-12 


497- 
499 


NTS 


21655 


Hs. 153952 


X55740; 
T65120; 
T65189 


5' nucleotidase (CD73) 


500- 
502 


MT2A 


590150 


Hs.l 18786 


J00271; 
AA156031; 
AA156201 


metallothionein 2A 


503 


ZNF220 


949928 


Hs.82210 


AA599173 


zinc finger protein 220 


504- 
506 


AKT1 


810331 


Hs.71816 


U97276; 
AA464152; 
AA464217 


BPGF-l=bone-derived 
growth factor; v-akt 
murine thymoma viral 
oncogene homolog 1 


507- 
509 


PTEN 


322160 


Hs.10712 


U92436; 
W37864; 
W37855 


MMAC 1 =PTEN=Tum 
or suppressor gene at 
10q23.3 that is 
Mutated in Multq>le 
Advanced 

Cancers=Phosphatase 
and tensin homolog 


510- 
512 


UBL1 


758495 


Hs.81424 


U83117; 
AA401634; 
AA401864 


ubiquitin-homology 
domain protein PIC1 


513- 
514 


WNT2 


302286 


Hs.89791. 


N78828; 
W17194 


.. wingless-type MMTV . 
integration site family 
member 2 


515- 
517 


SFRP4 


841282 


Hs.105700 


AF026692; 
AA487193; 
AA486838 


frizzled related protein 
frpHE 


518- 
520 


RUNX1 


263251 


Hs.129914 


D43968; 
H99599; 
H99598 


AML1 Prpto- 
oncogene 


521- 
523 


TALI 


71727 


Hs.73828 


X51990; 
T51236; 
T51350 


T-ceil acute 
lymphocytic leukemia 
1 


524- 
526 


WAS 


236282 


Hs.2157 


U12707; 
H61193; 
H62098 


Wiskott-Aldrich 
syndrome protein 


527- 
528 


PCTK1 


713382 


Hs.171834 


X66363; 
AA283125 


PCTAIRE 1 
serine/threonine 
protein kinase 


529 


EBP 


295986 


Hs.75105 


N67038 


emopamil-binding 
protein (sterol 
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isomerase) 


530 


SMC1L1 


897997 


Hs.211602 


AA598887 


SMC1 (structural 
maintenance of 
chromosomes 1, 
yeast)-like 1 


531- 
532 


AR API 


ZU/OIO 


HS.77183 


H59758; 
H59757 


v-raf murine sarcoma 
3611 viral oncogene 
homolog 1 | 


533 


UEE1 


898262 


Hs.2055 


AA598670 


ubiquitin-activating ! 
enzyme El 


534- 
535 


LOC5176 
0 


52226 


Hs.26971 


H23265; 
H23376 


B/K protein 


536- 
537 


LRPAPl 


842785 


Hs.75140 


AA486209; 
AA486313 


low density 

lipoprotein-related 

protein-associated 

protein 1 (alpha-2- 

macroglobulin 

receptor-associated 

protein 1) 


538- 
540 


PSXPIP1 


71434 


Hs. 129758 


JL07633; 
T47815; 
T47814 


interferon-gamma IEF 
SSP 5111; Interferon 
gamma upregulated 
protein j 


541- 
542 


EDH2 


869375 


Hs.5337 


X69433; 
AA679907 


isocitrate 1 
dehydrogenase 2 j 
(NADP+), 

mitochondrial | 


543 


SIAHBP1 


85469.6 


Hs.74562 


AA630094 


fuse-binding protein- f 
interacting repressor i 


544 


SLC25A1 
1 


878413 


Hs. 184877 


AA670357 


solute carrier family j 
25 (mitochondrial I 
carrier; oxoglutarate f 
carrier), member 11 1 


545- 
547 


LCN2 


302127 


Hs.204238 


X99133; 
N79823; 
W38398 


hpocalin 2 (oncogene j 
24p3) 


548- 
550 


TCEB2 


52162 


Hs.172772 


L42856; 
H22966; 
H24146 


Elongin B=RNA 
polymerase II 
transcription factor 
SHIpl8subunit 


551- 
553 


NM23H1 


176482 




X17620; 

tja icon 

H42520; 
H43520 


nm23-Hl=NDP kinase 
A^Nucleoside 
dephophate kinase A 


554- 
556 


SCYB5 


198699 


Hs.89714 


X78686; 
RP5077; 


ENA78=chemokine | 


557- 
558 


PAK2 


231951 


Hs.284275 


U25975; 
H92785 


hPAK65=SER/THR- ! 
protein kinase PAK- 

gamma =P21-activated 
kinase 3 


559- 
561 


S100A4 


472180 


Hs.81256 


M80563; 

AAUD / j o; 

AA036758 


S 1 00 calcium binding 
protein A4=Placental 
calcium binding ! 
protein=Calvasculin= 
mtsl 1 
PROTEIN=CAPL 1 


562- 
564 


CD83 


564503 


Hs.79197 


Z11697; 
AA101749; 
AA101748 


CD83=B-G antigen ] 
[gV domain 
iomolog=B-cell 
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activation 
protein=HB15 


565- 
567 


NCOA1 


609445 


Hs.74002 


U59302; 
AA180462; 
AA179970 


SRC-l=steroid 
receptor coactivator 


568- 
570 


RYBP 


649654 


Hs.7910 


AF179286; 
AA216739; 
AA216519 


Death effector 
domain-associated 
factor=Binds to 
Caspase 10DED 
domain=Homolog of 
mouse RYBP 
repressor protein that 
interacts with 
Polycomb complex 
and YY1=YAF2 
homolog=DEDAF=Y 
AF2 

homolog=MLNewGen 
e3 


571- 
573 


ITGAE 


665279 


Hs.851 


L25851; 
AA195282; 
AA195146 


CD 103 alpha=Integrin 
alpha-E ' 


574- 
576 


IL7 


701422 


Hs.72927 


J04156; 
AA287945; 
AA288010 


DL-7 


577- 
579 


CD36 


243816 


Hs.75613 


M98399; 
N39161; 
N45238 


CD36 


580- 
582 


PDGFRB 


773439 


Hs.76144 


J03278; 
AA426020; 
AA428115 


Platelet-derived 
growth factor receptor, 
beta 

polypeptide=fused to 
TEL in 

t(5;12)(q33;pl3) 
chronic 

myelomonocytic 
leukemia 


583- 
584 


IL17R 


842122 


Hs.129751 


U58917; 
AA634809 


IL-17 receptor 


585- 
586 


HGF 


1219612 


Hs.809 


X16323; 
AA687773 


Hepatocyte growth . 
factor (hepapoietin A; 
scatter factor) 


587- 
588 


BAD 


1286754 


Hs.76366 


U66879; 
AA740876 


BAD=bbc6=proapopto 
tic Bcl-2 homolog 


589- 
591 


ZNF173 


755176 


Hs.1287 


U09825; 
AA421953; 
AA421952 


acid finger protein 


592- 
593 


ZFP161 


285742 


Hs.156000 


D89859; 
N64141 


ZF5=POZ domain 
zinc finger protein 


594- 
596 


RGS16 


470132 


Hs.183601 


U70426; 
AA029960; 
AA029959 


A28-RGS14p=G 
protein signaling 
regulator 


597- 
599 


PPP1CB 


485729 


Hs.21537 


X80910; 
AA040285; 
AA040284 


PPPlCB=Protein 
phosphatase 1, 
catalytic subunit, beta 
isoform 


600- 
602 


GART 


502761 


Hs.82285 


X54199; 
AA126256; 
AA126360 


Phosphoribosylglycina 
mide 

fonnyltransferase, 
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phospho- 

ribosylglycinaniide 
synthetase, 

phosphoribosylaminoi 
midazole synthetase 


603- 
605 


ENPP1 


786041 


Hs.11951 


D12485; 
AA448639- 
AA448731 


PC-1 = alkaline 

nuHpotidf* 

UUvlwV UUb 

pyrophosphatase 


606- 
608 


MMP13 


786029 


Hs.2936 


X75308; 
AA448634- 
AA448726 


MMP-13=Matrix 

iiiutanuprotciUd.bc 

1 3=CL-3^Collagenase 
3 


609- 
611 


ILK 


292313 


Hs.6196 


U40282; 
N62542; 
N79210 


ILKHntegrin-linked 
kinase 


612- 
614 


SCYA4 


205633 


Hs.75703 


J04130* 
H62864; 
H62985 


MIP-1 

beta=SCAY2=G- 

26=HC21=pAT 

744=LAG-l=Act- 

2=H400=SIS- 

gamma=chemokine 


615- 
617 


KDR 


469345 


Hs. 12337 


AF035121; 
AA027012; 
AA026831 


Kinase insert domain 
receptor (a type III 
receptor tyrosine 
kinase) 


618- 
620 


1L18R1 


755054 


Hs.159301 


U43672; 
AI821652; 
AI734039 


EL- 18 receptor 1=IL- 
lRrp 


621- 
623 


PPP2R5A 


41356 


Hs.155079 


L42373; 
R59165; 
R59164 


phosphatase 2A B56- 
alpha(PP2A) 


624- 
625 


PTK2B 


180298 


Hs.20313 


U43522; 
R85257 


protein tyrosine kinase 
PYK2 


626- 
628 


MAP2K3 


45641 


Hs.180533 


D87116; 
H08749; 
H08467 


Dual specificity 
mitogen-activated 
protein kinase kinase 3 


629- 
631 


TNFR2R 
P 


124034 


Hs. 117847 


L04270; 
R02558; 
R02676 


Lymphotoxin-Beta 
receptor precursor = 
Tumor necrosis factor 
receptor 2 related 
protein = Tumor 
necrosis factor C 
receptor 


632- 
633 


EST 


739852 


Hs.328687 


AI821550; 
AA477842 


ESTs, Moderately 
similar to 

ALU4_HUMAN ALU 
subfamily SB2 
sequence 

contamination warning 
entry [H.sapiens] 


634- 
635 


EST 


1862171 


Hs.310541 


AI053777; 
AI792563 


ESTs 


636 


EST 


1985026 




AI251605 


Unknown 


637 


EST 


2002071 


Hs.203960 


AI249848 


ESTs 


638 


EST ! 


2047317 




AI3 11297 


Unknown 


639 


EST 


2215752 




AI567814 


Unknown 


640 


EST 


2217459 




AI744181 


Unknown 


641 


EST 


2217834 


Hs.328451 


AI744330 


EST, Weakly similar 
to PRPP HUMAN 
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salivary proline-rich 
protein II- 1 
[H.sapiens] 


642 


EST 


2219300 




AI745684 


Unknown 


643 


EST 


2220085 


Hs337231 


AI798317 


EST 


644 


EST 


2220214 




AI798385 


Unknown 


645 


EST 


2261174 


Hs.185554 


AI609326 


EST 


646 


EST 


2261169 




AI609331 


Unknown 


647 


EST 


2292810 


Hs.224732 


AI871658 


EST 


648 


EST 


2292831 


Hs.337311 


AI871678 


EST 


649 


EST 


2549950 




AI954130 


Unknown 


650 


EST 


2550130 




AI953438 


Unknown 


651- 
652 


FDFT1 


25725 


Hs.48876 


R36960; 
R11842 


farnesyl-diphosphate 
famesyltransferase 1 


653- 
654 


NAGA 


28985 


Hs.75372 


R40255; 
R14305 


N- 

acetylgalactosaminida 
se, alpha- 


655- 
656 


SECRET 


29054 


Hs.l 16428 


R40850; 
R14422 


secretagogin 


657- 
658 


SLC9A1 


30272 


Hs.170222 


R42414; 
R14692 


solute carrier family 9 
(sodium/hydrogen 
exchanger), isoform 1 
(antiporter, Na+/H+, 
amiloride sensitive) 


659 


TUFM 


34945 


Hs.12084 


R45183 


Tu translation 
elongation factor, 
mitochondrial 


660- 
661 


MNAT1 


38471 


Hs.82380 


R49475; 
R35961 


menage a trois 1 
• (CAK assembly 
factor) 


662- 
663 


HARS 


43021 


Hs.77798 


R60150; 
R60149 


histidyl-tRNA 
synthetase 


664- 
665 


EIF4A1 


46171 


Hs.129673 


H09590; 
H09589 


eukaryotic translation 
initiation factor 4A, 
isoform 1 


666- 
667 


MPI 


50359 


Hs.75694 


H17096; 
H17714 


mannose phosphate 
isomerase 


668- 
669 


TAGLN2 


45544 


Hs.75725 


H08564; 
H08563 


transgelin 2 


670- 
671 


PON1 


128143 


Hs.1898 


R12373; 
R09781 


paraoxonase 1 


672- 
673 


EST 


813444 


Hs.l 78379 


AA455945; 
AA455554 


ESTs 


674- 
675 


GNB2 


292213 


Hs.91299 


N68166; 
N80625 


guanine nucleotide 
binding protein (G 
protein), beta 
polypeptide 2 


676- 
677 


KIAA036 
5 


811029 


Hs.190452 


AA485383; 
AA485539 


KIAA0365 gene 
product 


678 


NCSTN 


199645 


Hs.4788 


R96527 


nicastrin 


679- 
681 


ARHGEF 
6 


687990 


Hs.79307 


D25304; 
AA236957; 
AA236617 


KIAA0006 


682 


SFRS11 


204755 


Hs. 11482 


H56944 


splicing factor, 
arginine/serine-rich 11 


683- 
684 


CUGBP1 


25588 


Hs.81248 


R15111; 
R12181 


CUG triplet repeat, 
RNA-binding protein 
1 


685- 


GABRP 


563598 


Hs.70725 | AA101225; 


gamma-aminobutyric 
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686 








AA102670 


acid (GABA) A 
receptor, pi * 


687- 
688 


BMP 6 


768168 


Hs 285671 


AA424833- 
AA426586 


bone morphogenetic 
protein 6 


689- 
690 


ATP7A 


687820 


Hs.606 


AA236141; 
AA236635 


ATPase, Cu-H- 
transporting, alpha 
polypeptide (Menkes 
syndrome) 


691- 
692 


RBBP4 


773599 


Hs. 16003 


AA428365; 
AA429422 


retinoblastoraa- 
binding protein 4 


693- 
694 


POLR2A 


740130 


Hs.171880 


AA479052; 
AA477535 


polymerase (RNA) II 
(DNA directed) 
polypeptide A 
(220kD) 


695- 
696 


SMG1 


785605 


Hs.110613 


AA449463; 
AA448998 


PI-3-kinase-related 

kinase 

SMG-1 


697- 
698 


GTPBP1 


826217 


Hs.283677 


U87964; 
AA521469 


GP-l=putative G- 
protein 


699- 
700 


GS2NA 


767994 


Hs.183105 


AA418821; 
AA418918 


nuclear autoantigen 


701- 
702 


FLJ12442 


32231 


Hs.84753 


R42815; 
R17469 


hypothetical protein 
FU12442 


703- 
704 


KIAA021 
8 


49404 


Hs.75863 


HI 5567; 
H15627 


KIAA0218gene 
product 


705- 
706 


KIAA014 
4 


245015 


Hs.8127 


N52646; 
N72374 


KIAA0144 gene 
product 


707- 
708 


FOX01A 


628955 


Hs.170133 


AA194765; 
AA194764 


forkhead boxOlA 
(rhabdomyosarcoma) 


709 


CSRP2 


75254 


Hs. 10526 


T59334 


cysteine and glycine- 
rich protein 2 


710- 
711 


BRE 


739993 


Hs.80426 


AA479741; 
AA477082 


brain and reproductive 
organ-expressed 
(TNFRSF1A 
modulator) 


712- 
713 


RALY 


825583 


Hs.74111 


AA504617; 
AA504712 


RNA-binding protein 
(autoantigenic) 


714- 
716 


FGFR2 


809464 


Hs.282823 


M87771; 
AA443093; 
AA456160 


FGFR2=Fibroblast 
growth factor receptor 
2 


717- 
718 


EST 


242820 


Hs.290870 


H94050; 
H94131 


ESTs, Weakly similar 
to 138588 reverse 
transcriptase homolog 
[H.sapiens] 


719- 
720 


PEF 


137353 


Hs.241531 


R38031; 
R38117 


PEF protein with a 
long N-terminal 
hydrophobic domain 
(peflin) 


721- 
722 


EST 


265494 


Hs.153445 


N21309; 
N31244 


Human mRN A for 
unknown product, 
partial cds 


723- 
724 


SAST 


739625 


Hs.227489 


AA479623; 
AA477008 


syntrophin associated 
serine/threonine kinase 


725- 
726 


EST 


142499 




R70037; 
R70084 


Unknown 


727- 
728 


PLXNA2 


303035 


Hs.300622 


N91580; 
W19130 


plexin A2 


729- 
730 


EST 


240694 


Hs. 167787 


H78135; 
H78134 


ESTs 
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731- 
732 


APMCF1 


198904 


Hs.12152 


R95693; 
R95692 


APMCF1 protein 


375 


CALU 


144881 


Hs.7753 




calumenin 


733- 
734 


PPY2 


210873 


Hs.20588 


H67736; 
H66312 


pancreatic polypeptide 
2 


735- 
736 


CRB1 


248485 


Hs. 169745 


N59646; 
N78199 


crumbs (Drosophila) 
homolog 1 


737- 
738 


FLJ21661 


80095 


Hs.334718 


T63321; 
T63940 


hypothetical protein 
FLJ21661 


739- 
740 


RAB3A 


163579 


Hs.27744 


H14231; 
H14230 


RAB3A, member RAS 
oncogene family 


741- 
742 


GCAT 


307094 


Hs.54609 


N93695; 
W21033 


glycine C- 
acetyltransferase (2- 
amino-3-ketobutyrate 
coenzyme A ligase) 


743- 
744 


P14L 


809437 


Hs. 178576 


AA458464; 
AA442976 


similar to Bos taurus 
P 14 protein 


745- 
746 


KIAA00O 
8 


357373 


Hs.77695 


W93717; 
W93568 


KIAA0008 gene 
product 


747- 
748 


LOX 


341680 


Hs.102267 


W60414; 
W60413 


lysyl oxidase 


749- 
750 


PISD 


343609 


Hs.8128 


W69460; 
W69544 


phosphatidylserine 
decarboxylase 


751- 
752 


EST 


341834 


Hs.27278 


W60647; 
W60905 


ESTs, Weakly similar 
to A Chain A, 
Cyclophilin A 
[H.sapiens] 


753- 
754 


EST 


809490 


Hs.3737 


AA443117; 
AA456181 


ESTs 


755- 
756 


TMEPAI 


809824 


Hs.83883 


AA455519; 
AA464401 


transmembrane, 
prostate androgen 
induced RNA 


757- 
758 


ZNF211 


346947 


Hs.15110 


W79396; 
W79316 


zinc finger protein 211 


759 


LOC5160 
5 


810343 


Hs.128791 


AA464166 


CGI-09 protein 


760- 
761 


MAPRE1 


428223 


Hs.234279 


AA001749; 
AA001819 


microtubule- 
associated protein, 
RP/EB family, 
member 1 


762 


FLJ10701 


430068 


Hs. 146589 


AA009830 


hypothetical protein 
FLJ10701 


763- 
764 


DKFZP56 
4C186 


366353 


Hs. 134200 


AA026278; 
AA026277 


DKFZP564C186 
protein 


765 


EST 


810205 


Hs.264606 


AA464518 


ESTs 


766- 
767 


F23149J 


428507 


Hs.152894 


AA004525; 
AA004607 


hypothetical protein 
F23149 1 


768- 
769 


FLJ21940 


810795 


Hs. 1049 16 


AA458876; 
AA459066 


hypothetical protein 
FLJ21940 


770- 
771 


FLJ22059 


292223 


Hs. 13323 


N62464; 
N79183 


hypothetical protein 
FLJ22059 


772- 
773 


EST 


241861 


Hs.269020 


H93115; 
H93243 


ESTs 


398 


RGS1 


361323 


Hs.75256 




regulator of G-protein 
signalling 1 


774 


PDE6A 


361840 


Hs.182240 


W92514 


phosphodiesterase 6A, 
cGMP-specific, rod, 
alpha 


775- 


IL1B 


491763 


Hs. 126256 


AA150507; 


interleukin 1, beta 
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1 776 








AA156711 




777 


SF3B4 


432564 


Hs.25797 


AA699361 


splicing factor 3b, 
subunit 4, 49kD 


778 


JbST 


277627 


Hs.348427 


N45979 


Human SH3 domain- 
containing protein 
SH3P18mRNA, 
complete cds 


770 


! i y^rn 


1 OD4581 


Hs.326198 


AA669136 


transcription factor 4 


780- 
781 


RAB2L 


741891 


Hs. 170160 


AA401972; 
AA402117 


RAB2, member RAS 
oncogene family-like 


782- 


GOLOA1 


| 34102 


Hs.172647 


R44140; 
R23687 


golgi autoantigen, 
golgin subfamily a, 1 


784 


TNRC12 


770000 


Hs.306094 


AA427519 


trinucleotide repeat 
containing 12 


785 


CSNK1E 


854138 


Hs.79658 


AA669272 


casein kinase 1, 
epsilon 


786- 
787 


| AFP 


j 74537 


Hs. 155421 


T59043; 
T59118 


alpha-fetoprotein 


788- 
789 


PfWA 1 
j wVAl 


| jooozz 


TT- 1 f f 1 O £• 

Hs. 155185 


AA156560; 
AA157732 


cytosolic ovarian 
carcinoma antigen 1 


790- 
791 


J nTCA 




Hs.73722 


AA478273; 
AA478331 


APEX nuclease 
(multifunctional DNA 
repair enzyme) 


792- 
793 


RBBP2 


841655 


Hs.76272 


AA487492; 
AA487706 


retinoblastoma- 
bindingjprotein 2 


794- 
796 


j GR01 


323238 




M36820; 
W42723; 
W42812 


Human cytokine 
(GRO-beta)mRNA; 
GR02=GRObeta = 
MIP2 alpha = 
macrophage 
inflammatory protein- 
2 alpha = chemokine 


797- 
799 


WNT2 j 


149373 




X07876; 
H04382; 
H04408 


wingless-type MMTV 
integration site family 
member 2 


800 


PTGS2 


845477 


Hs.196384 


AA644211 


cyclooxygenase-2; 
prostaglandin 
endoperoxide 
synthase-2 


5U1- 


T>T>XTD 1 


470074 


Hs.74621 


AA029059; 
AA029163 


prion protein 


803- 
804 


WNT5B 


323636 


Hs.306051 


W44518; 
W44517 


Homo sapiens mRNA 
for WNT5B, complete 
cds 


805 


CD72 


1241854 




AA7 14696 




806 




897774 




AA598510 


adenine 

phosphoribosyltransfer 
ase 


808 




795893 




AA460168 
AA460768 


protein phosphatase 1, 
regulatory (inhibitor) 
subunit 15A 


809- 
810 








A A Cf\A 111 

AA504113 
AA504371 


M-phase 

phosphoprotein 10 


811- 
812 




154720 




R55220 
R55219 


ARD1 homolog, N- 
acetvltransfera^p (S\ 
cerevisiae) 


813- 
814 




204214 




H59204 
H59203 < 
< 


CDC6 cell division 
sycle 6 homolog (S. 
:erevisiae) 
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815- 
816 




815294 




AA481547 
AA481613 


protein tyrosine 
phosphatase, receptor 
type, C-associated 
protein 


817 




825265 




AA504204 


polymerase (DNA 
directed), delta 3 


818 




2549467 




AI952542 


unknown EST 


819 




1056107 




AA628360 


putative cyclin Gl 
interacting protein 


820- 
821 




809515 




AA454565 
AA456458 


pLK=homologue of 
Drosophila polo 
serine/threonine kinase 


822 


CMKBR6 






U45984 


CCR6=STRL22=chem 
okine receptor for 
MDP-3 

alpha/LARC/Exodus 
on activated B cells 
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Table 4. Markers that were Under-expressed in Ovarian Cancer in a 
Comparison of Ovarian Epithelial Cancer to Normal Postmenopausal Ovarian 
Tissue 



SEQ. ID. 
NO. 


IMAGE ED 


Nucleic 
Acid 


Description 


Average 

log 
normal 


Average 

log 
cancer 


Cancer 

to 
normal 


202 


878596 


ITM2A 


integral membrane protein 2A 


1.145 


-2.036 


0.110 


203-204 


42558 


GATM 


glycine amidinotransferase (L- 

arginine:glycine 

amidinotransferase) 


4.137 


0.945 


0.109 


205-207 


81417 


KNASE4 


ribomiclease L (2',5'- 
oligoisoadenylate synthetase- 
dependent) 


2.057 


-0.744 


0.144 


208-210 


471642 


LAMA2 


laminin alpha 2 (merosin, 
congenital muscular dystrophy) 


2.806 


0.361 


0.184 


211 


448386 


PBX3 


pre-B-cell leukemia 
transcription factor 3 


2.354 


-0.243 


0.165 


212 


1472538 


PLA2G6 


phospholipase A2, group VI 
(cytosolic, calcium-independent) 


2.604 


0.099 


0.176 


213 


814636 


SMARCA2 


SWI/SNF related, matrix 
associated, actin dependent 
regulator of chromatin, 
subfamily a, member 2 


3.055 


0.231 


0.141 


214-215 


488956 


CUGBP2 


CUG triplet repeat, RNA- 
binding protein 2 


2.960 


-0.043 


0.125 


216-218 


209655 


TGFBR3 


TGF beta receptor type III 


1.956 


0.057 


0.268 


219 


859858 


STAR 


steroidogenic acute regulatory 
protein 


1.685 


0.026 


0.317 


220 


1636447 


GNG11 


guanine nucleotide binding 
protein 1 1 


1.683 


-0.953 


0.161 


221 


491565 


CITED2 


Cbp/p300-interacting 
transactivator, with Glu/Asp- 
rich carboxy-terminal domain, 2 


1.576 


-0.497 


0.238 


222 


744647 


CTNNAL1 


catenin (cadherin-associated 
protein), alpha-like 1 


1.498 


-0.761 


0.209 


223 


743773 


ABCA8 


ATP-binding cassette, sub- 
family A (ABC1), member 8 


2.317 


0.060 


0.209 


224-226 


188232 


KLF4 " 


GKLF=EZF=KLF4=gut- 
enriched Kruppel-like zinc 
finger protein=expressed in 
vdisL/iudr cnaoLQciiai ecus 


1.644 


-0.741 ' 


0.191 


227-228 


471725 


ITPR1 


inositol 1,4,5-triphosphate 
reeentor £vr»e 1 


1.600 


-0.616 


0.215 


229-230 


487793 


MAF 


v-maf musculoaponeurotic 
fibrosarcoma (avian) oncogene 
lomolog 


0.765 


-2.032 


0.144 


231-232 


768370 


FOXC1 


forkhead box CI 


2.270 


-0.021 


0.204 


233 


461351 


TCF21 


transcription factor 21 


1.733 


0.193 


0.344 


234-236 


248295 


CCNI 


Cyclin I 


2.460 


-0.030 


0.178 


237-238 


209367 


DCN 


iecorin 


3.762 


0.582 


0.110 


239-240 


789049 


CBF2 

1 


CCAAT-box-binding 
transcription factor 


2.140 


0.019 


0.230 
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241-242 


68049 




. Homo sapiens, clone 
IMAGE:4183312,mRNA, 
partial cds 


2.074 


0.243 


0.281 


122 


81417 


RNASE4 


ribonuclease L (2',5 - 
oligoisoadenylate synthetase- 
dependent) 


1.696 


-0.117 


0.285 


243 


2094012 


SLC4A1AP 


solute carrier family 4 (anion 
exchanger), member 1, adapter 
protein 


1.869 


0.195 


0.313 


244-245 


377731 


GSTM5 


glutathione S-transferase M5 


1.558 


0.242 


0,402 


246 


460470 


C4BPB 


complement component 4- 
binding protein, beta 


0.750 


-0.851 


0.330 


247-248 


73609 


HS3ST1 


heparan sulfate (glucosamine) 3- 
O-sulfotransferase 1 


■ 2.017 


0.328 


0.310 


249 


2413955 


CDKN1C 


cyclin-dependent kinase 
inhibitor lC(p57,Kip2) 


2.739 


0.465 


0.207 


250 


897823 


HNRPDL 


heterogeneous nuclear 
ribonucleoprotein D-like 


1.703 


-0.229 


0.262 


251 


1558799 


CERBP 


cold inducible RNA-binding 
protein 


1.817 


-0.183 


0.250 


252 


2321596 


RGS2 


regulator of G-protein signalling 
2, 24kD 


1.607 


-0.290 


0.269 


253-254 


786607 


TCEAL1 


transcription elongation factor A 
{SII)-likel 


1.737 


0.083 


0.318 


255-256 


377461 


CAV1 


caveolin 1, caveolae protein, 
22kD 


0.146 


-2.506 


0.159 


257 


855624 


ALDH1A1 


aldehyde dehydrogenase 1 
family, member Al 


2.030 


-0.068 


0.233 


258-259 


343443 


RBPMS 


RNA-binding protein gene with 
multiple splicing 


1.727 


-0.240 


0.256 


260-261 


62263 


ADAMTS1 


a disintegrin-Iike and 
metalloprotease (reprolysin 
type) with thrombospondin type 
1 motif, 1 


•1.589 


0.029 


0.339 


262-263 


51408 


DSCR1L1 


Down syndrome critical region 
gene 1-like 1 


2.054 


0.260 


0.288 


264 


436121 


DLK1 


delta-like homolog (Drosophila) 


0.307 


-1.944 


0.210 


265-266 


491113 


CDH11 


cadherin 1 1 , type 2, OB- 
cadherin (osteoblast) 


2.308 


0.502 


0.286 


139 • 


840776 


SGK 


sgk=putative serine/threonine 
protein kinase transcriptionally 
modified during anisotonic and 
isotonic alterations of cell 

VvPJUXllG 


1.444 


-0.201 


0.320 - 


267 


450060 


HFL1 


rl factor (complement)-like 1 


2.490 


0.664 


0.282 








r nena or u A 1 A2 


1.905 


0.463 


0.368 


970 


RQR199 




complement component 7 


2.643 


0.660 


0.253 


271-272 


784109 


SGCE i 


sarcoglycan, epsilon 


1.573 


-0.213 


0.290 


273 


898305 


NBL1 i 
t 


leuroblastoma, suppression of 
umorigenicity 1 


1.910 


0.153 


0.296 


274-275 


173385 


HBB 1 


lemoglobin, beta 


0.165 


-2.317 


0.179 


276-278 


840683 


CARP ( 
I 


Cytokine inducible nuclear 
jrotein 


1.728 


0.018 


0.306 


279-280 


Z» r OJ / V/ 


jlvjul XT 1 

t 


lUClopnulaliJlla-aSSOCiatCQ 

ranscription factor 


ft O/IO 


1 CO A 


0.283 


281-282 


898062 


CDC20 j 


)55CDC 


1.679 


-0.043 


0.303 


283-285 


148028 


EPS8 


pidermal growth factor receptor 
ioase substrate (Eps8) 


1.595 


-0.344 


0.261 
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986-287 


JtJQOV 


ARHT 


rac fiAmfilnP' irpnp fflTTiilv 
Lao liuillUJVg gvuc laiJLLuy, 

member I 


1 595 


0 219 


0 383 

UJOJ 


288 


857002 


B4-2 


proline-rich protein with nuclear 
targeting signal 


1.523 


0.285 


0.424 


289-291 


186132 


SELE 


ELAMl=endothelial leukocyte 
adhesion molecule I 


1.480 


-0.393 


0.273 


292-293 


133273 


PMP22 


peripheral myelin protein 22 


1.767 


0.017 


0.297 


294 


340657 


EBAF 


endometrial bleeding associated 

fhrtot* npft-Htrht" ffptprminatirm 

factor A; transforming growth 
factor beta superfamily) 


1.422 


-0.076 


0.354 




UV7UUJ 


PRKAR2B 


nrAtpin 1rinfl<sp pAT^/TP- 

ylUlvlli lUllOOb) wjTVLViJL 

dependent, regulatory, type II, 
beta 


1 357 


-0 461 


0 284 


9Q7 


1 ^7^1 1 




nuuicax iauiur ui Kappa iigni 

polypeptide gene enhancer in B- 
cells inhibitor, epsilon 


1 949 


-1 109 


0 1 07 


298-299 


269806 


KIT 


v-kit Hardy-Zuckerman 4 feline 
sarcoma viral oncogene 
homolog 


2.324 


0.315 


0.248 


158 


309864 


JUNB 


jun B proto-oncogene 


3.465 


1.117 


0.196 


300 


1573108 


BCKDK 


branched chain alpha-ketoacid 
dehydrogenase kinase 


1.306 


-1.05? 


0.194 


301-303 


298268 


BTG1 


BTGl=B-cell translocation gene 
l=anti-proliferative 


1.409 


-0.239 


0.319 


304-305 


784772 


AKAP19 


A kinase fPRKA^ anchor nrotein 
(eravin) 12 


1.568 


-0,413 


0.253 


306-307 


898221 


NR4A2 


NOT=Immediate early response 
protein=Nurrl 
homologue=Nurr77 orphan 
steroid receptor family member 


2.092 


0.341 


0.297 


308-309 


126531 


HBB 


hemoglobin, beta 


0.190 


-2.034 


0.214 


310-311 


768489 


ARHGAP6 


Rho GTPase activating protein 6 


1.263 


-0.109 


0.386 


312 


1568391 


PLS3 


plastin 3 (T isoform) 


1.241 


-1.045 


0.205 


313-314 


300015 


FNTA 


farnesyltransferase, CAAX box, 
alpha 


1.354 


0.091 


0.417 


315-316 


770670 


TNFAIP3 


tumor necrosis factor, alpha- 
induced protein 3 


2.088 


0.445 


0.320 


317-318 


840944 


EGR1 


early growth response 1 


3.245 


0.765 


0.179 


319 


795213 


RNAC 


RNA cyclase homolog 


1.812 


0.043 


0.293 


320 


813584 


PA26 


n53 reenlated PA26 nuclear 
protein 


1.329 


0.084 


0.422 


321 


1573778 


Cllorfl3 


chromosome 1 1 open reading 
frame 13 


1.241 


-1.048 


0.205 


322 


2169465 


ING1L 


inhibitor of growth family, 
member 1-like 


1.267 


0.031 


0.425 


323 


2577249 


RPL9 


ribosomal protein L9 


1.756 


-0.313 


0.238 


324-325 


813711 


ADH5 


alcohol dehydrogenase 5 (class 
HI), chi polypeptide 


1.211 


-0.616 


0.282 


326-327 


298122 


FZD7 


frizzled (Drosophila) homolog 7 


2.554 


0.747 


0.286 


328-329 


366100 


MATN2 


matrilin 2 


2.205 


0.536 


0.315 


330-331 


71863 


SLC11A3 


solute carrier family 11 (proton- 
coupled divalent metal ion 
transporters), member 3 


2.466 


0.808 


0.317 


332-333 


767641 




ESTs 


0.965 


-0.153 


0.461 


334-335 


292463 


ERCC5 


excision repair cross- 
complementing rodent repair 
deficiency, complementation 


1.695 


-0.098 


0.289 
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Table 5. Markers that were Over-expressed in Ovarian Cancer in a Comparison 
of Ovarian Epithelial Cancer to Normal Postmenopausal Ovarian Tissue 



SEQ. ID. 
NO. 


IMAGE 
ID 


Nucleic 
Acid 


Description 


Average 

log 
normal 


Average 

log 
cancer 


Cancer 

to 
normal 


18-19 


82195 


SERPINF2 


Branched chain keto acid 
dehydrogenase El, beta 
polypeptide (maple syrup urine 
disease) 




1.45 




30-31 


295939 


FLJ22174 


hypothetical protein FLJ22174 




1.38 




50-51 


755599 


OTTM1 


Interferon induced 
transmembrane protein 1 (9-27) 




1.74 




: 55-57 


624655 


IFITM2 


Interferon-induced protein 1-8D 




1.53 




60 


786675 


HE4 


Epididymis-specific, whey-acidic 
protein type, four-disulfide core; 
putative ovarian carcinoma 
marker 




2.41 




68-69 


782513 


G1P3 


Interferon, alpha-inducible 
protein (clone IFI-6-16) 




1.64 




74-76 


182288 


DDR1 


Receptor protein-tyrosine kinase 
EDDR1 




1.43 




85-86 


811139 


HLA- 
DRB5 


Major histocompatibility 
complex, class II, DR beta 5 




1.91 




101-103 


417711 


HLA- 
DKB1 


Major histocompatibility 
complex, class II, DR beta 1 




1.94 




89-91 


725751 


CD74 


Invariant chain=la-associated 
invariant gamma-chain 




2.69 




92-93 


840681 


CD74 


Invariant chain=la-associated 
invariant gamma-chain 




2.58 




94-96 


117411 


HLA-DRA 


MHC Class II=DR alpha 




1.62 




97-99 


207715 


HLA-DPA 


MHC Class H=DP alpha 




1.85 




122-123 


361323 


RGS1 


regulator of G-protein signaling 1 




1.73 




133-135 


755279 


FOS 


c-fos . 




1.76. 




149-151 


279388 


SORL1 


Mosaic protein LR1 l=hybrid 
receptor gp250 precursor 




1.56 




164-166 


813256 


ABCB1 ; 


MDRl=Muludrug resistance 
protein l=P-glycoprotein 




1.64 




167-168 


23804 


ZFP36 


Zinc finger protein homologous 
to Zfp-36 in mouse 




1.74 




169-171 


135880 


ZFP36 


TTP=tristetraproline=GOS24=zin 
c finger transcriptional regulator 




2.00 




174-175 


41650 


HGF 


hepatocyte growth factor 
(hepapoietin A; scatter factor) 




1.54 




176-178 


840776 


SGK 


sgk=putative serine/threonine 
protein kinase transcriptionally 
modified during anisotonic and 
isotonic alteration 




2.02 




179-180 


814508 


PPP1R7 


Protein phosphatase 1, regulatory 
subunit 7 




1.78 




181-182 


204335 


CD24 


CD24 antigen (small cell lung 
carcinoma cluster 4 antigen) 




2.75 
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190-192 


485770 


BRF2 


Tis 1 ld=ERF-2=growth factor 
early response gene 




1.62 




199-201 


144675 


TLR3 


TLR3- Toll-like receptor 3 




1.98 




340-341 


378813 


SLPI 


secretory leukocyte protease 
inhibitor (antileukoprotcinase) 


-0.379 


2.294 


6.377 


342 


378461 


SPP1 


secreted phosphoprotein 1 
(osteopontin, bone sialoprotein I, 
early T-lymphocyte activation 1) 


-2.657 


-0.088 


5.938 


343-344 


741977 


BF 


B-factor, properdin 


-0.362 


1.953 


4.974 


345-347 


810899 


CKS1 


ckshsl=homolog of 

Cks l=p34Cdc28/Cdc2-associated 

protein 


-1.484 


0.637 


4.351 


348-349 


470393 


MMP7 


matrix raetalloproteinase 7 
(matrilysin, uterine) 


0.673 


2.535 


3.635 


350-351 


742101 


PAX8 


paired box gene 8 


-0.566 


1.196 


3.391 


352-353 


814378 


SPINT2 


serine protease inhibitor, Kunitz 
type, 2 


-0.306 


1.432 


3.336 


354 


451907 


ZWINT 


ZW10 interactor 


-2.461 


-0.856 


3.043 


355 


2544675 


DGKH 


diacylglycerol kinase, eta 


-0.036 


1.498 


2.896 


356 


782811 


HMGIY 


high-mobility group (nonhistone 
chromosomal) protein isoforms I 
andY 


-2.272 


-0.760 


2.851 


357-359 


504763 


SDC4 


Syndecan-4 = amphiglycan = 
ryudocan core protein 


-0.871 


0.575 


2.725 


360 


1161155 


CDKN2A 


cyclin-dependent kinase inhibitor 
2 A (melanoma, pi 6, inhibits 
CDK4) 


-0.839 


0.593 


2.699 


361-362 


810873 


SCNN1A 


sodium channel, nonvoltage- 
gated 1 alpha 


0.127 


1.534 


2.652 


363 


43550 


LDHA 


lactate dehydrogenase A 


-3.496 


-2.152 


2.538 


364-365 


131839 


FOLR1 


folate receptor 1 (adult) 


-0.867 


0.467 


2.522 


366-367 


855749 


TPI1 


Triosephosphate isomerase 1 


-2.272 


-1.008 


2.400 


368 


2514426 


KLK8 


kallikrein 8 (neuropsin/ovasin) 


-0.491 


0.742 


2.352 


200 


79629 


CXCR4 


CXC chemokine receptor 4= 
fusin==neuropeptide Y 
receptor=L3 


-0.588 


0.618 


2.307 


369-370 


825606 


KNSL1 


kdnesin-like 1 


-1.797 


-0.602 


2.290 


371-372 


488964 


H2AFO 


H2A histone family, member O 


-1329 


-0.144. 


2.274 


373-374 


153411 


HLA-DRA 


major histocompatibility 
complex, class 11, DR alpha 


1.967 


3.150 


2.270 


<j / D 




LJtvJUr 1 


cysteine-rich protein 1 (intestinal) 


U.UoO 


1.24o 


2.ZJ4 


J /\j 


QC A7fl.fi 


PP 


pyrophosphatase (inorganic) 




f\ 1 1 Q 

U.l Jo 


*) O 1 A 

2.214 


177 378 




xSo IS 


Unknown 


(\ T\ A 
U.214 




l.L\l 


170 1Q1 


OA*7KQ 

zO/iDo 


or n A i 
&LC2A1 


glucose transporter (HepG2) 


-1.190 


-0.050 


2.204 


! 382 


897770 


ESTs 


Unknown 


-0.173 


0.943 


2.167 


383-385 


813673 


HDGF 


hepatoma-derived growth factor 


-0.786 


0.329 


2.166 


| 386 


882522 


ASS 


argininosuccinate synthetase 


-0.424 


0.676 


2.143 


387-388 


770388 


CLDN4 


claudin4 


0.065 


1.159 


2.135 


389 


897956 


PRAME ] 


preferentially expressed antigen 
in melanoma 


-2.071 


-0.977 


2.134 


390-391 


897788 


ptprf ; 


LAR = LCA-homologue 


-0.175 


0.900 


2.108 


392-393 


741139 


EYA2 < 


syes absent (Drosophila) 
aomolog 2 


-0.133 


0.939 


2.102 


394-396 


138917 


MYCL1 ] 


L-myc 


-0.042 


1.026 


2.096 


397-399 


840691 


STAT1 ! 

i 


3TAT1=IFN alphafteta- 
•esponsive transcription factor 


-0.023 : 


1.044 


2.095 
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ISGF3 beta subunits (p91/p84) 








400-401 


564492 


MTCH2 


mitochondrial carrier homolog 2 


-1.566 


-0.512 


2.076 


402 


435597 


HTR3A 


5-hydroxytryptamine (serotonin) 
receptor 3A 


-0.392 


0.656 


2.067 


403-404 


68950 


CCNE1 


cyclin El 


-0.470 


0.577 


2.066 


405 


739155 


CDH6 


cadherin 6, type 2, K-cadherin 
(fetal kidney) 


-0.286 


0.748 


2.048 


406-408 


531028 


PRKAG1 


5'-AMP-activated protein kinase, 
gamma- 1 subunit 


0.148 


1.181 


2.046 


409 


2403485 


DEFB1 


defensin, beta 1 


0.335 


1.357 


2.031 


410-411 


626502 


ARPC1B 


actin related protein 2/3 complex, 
subunit 1 A (41 kD) 


-0.808 


0.213 


2.030 


412-414 


71622 


PRKCI 


PKC iota=Protein kinase C, iota 


-0.202 


0.802 


2.006 


415 


1610448 


; GAPD 


glyceraldehyde-3-phosphate 
dehydrogenase 


-1.484 


-0.480 


2.005 


416-417 


85497 


C2 


complement component 2 


-0.413 


0.589 


2.002 


418-419 


843075 


H2AFY 


H2A histone family, member Y 


-0.829 


0.164 


1.990 


420-421 


840567 


TM4SF1 


transmembrane 4 supertamily 
member 1 


-1.261 


-0.270 


1.987 


422-423 


50117 


GAPD 


glyceraldehyde-3 -phosphate 
dehydrogenase 


-2.692 


-1.706 


1.981 


424-426 


809910 


IFITM3 


Interferon-inducible protein 1-8U 


-0.099 


0.887 


1.981 


427-428 


248261 


GLDC 


glycine dehydrogenase 
(decarboxylating; glycine 
decarboxylase, glycine cleavage 
system protein P) 


-0.757 


0.221 


1.970 


429-430 


144881 


CALU 


calumenin 


-1.579 


-0.620 


1.943 


431-432 


208764 


HBA2 


hemoglobin, alpha 2 


-0.114 


0.837 


1.934 


433 


810612 


S100A11 


SI 00 calcium-binding protein 
Al 1 (calgizzarin) 


-0.670 


0.279 


1.931 


434-436 


897567 


LDHA 


Lactate dehydrogenase A 


-2.982 


-2.038 


1.925 


437 


769921 


UBE2C 


ubio^tin-conjugating enzyme 
E2C 


-1.429 


-0.487 


1.922 


438-440 


304908 


B2F3 


E2F-3=pRB-binding transcription 
factor=KlAA0075 


-0.526 


0.416 


1.921 


441-442 


251019 


CDH1 


E-cadherin 


-0.248 


0.682 


1.905 


443-444 


210405 


PSME2 


proteasome (prosome, 
macropain) activator subunit 2 
(PA28beta) 


-0.715 


0.213 


1.902 


445-447 


344430 


BMP7 


OP-l=osteogenic protein in the 
TGF-beta family 


-0.075 


0.852 


1.901 


448 


825470 


TOP2A 


TOP2A 






2 
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Table 6. Markers that were Differentially Expressed Between £i?012-Linked 
and Sporadic Tumors in a Comparison to reference Immortalized Ovarian 
Epithelial Cells. 



SEQ 
ID 
NO. 


Nucleic 
Acid 


Description 


Geometric 
mean of 
ratios in 
BRCA1 


Geometric 
mean of 
ratios in 
sporadic 


Fold 
difference 
in 

geometric 
means 


805 


CD72 


B-cell differentiation antigen 
CD72 (human); 


1.49 


1.17 


0.79 


544 


SLC25A11 


solute carrier family 25 
(mitochondrial carrier; 
oxoglutarate carrier), member 1 1 


1.27 


1.08 


0.84 


545- 
547 


LCN2 


lipocalin 2 (oncogene 24p3) 


1.29 


0.98 


0.76 


538- 
540 


PSTPIP1 


interferon-gamma IEF SSP 
5111; Interferon gamma 
unregulated protein 


1.95; 1.6 


1.31; 1.04 


0.67; 0.65 


543 


SIAHBP1 


fuse-binding protein-interacting 
repressor 


1.86 


1.21 


0.65 


533 


UBE1 


ubiqritm-activating enzyme El 


1.54 


0.94 


0.61 


524- 
526 


WAS 


Wiskott-Aldrich syndrome 
protein 


1.13 


0.79 


0.7 


541- 
542 


IDH2 


isocitrate dehydrogenase 2 
(NADP+), mitochondrial 


1.69 


1.02 


0.6 


527- 
528 


PCTK1 


PCTADRE l serine/threonine 

protein kinase j 


1.33 


1.12 


0.84 



Table 7. Markers that were Differentially Expressed Between 2?jRC42-Linked 
and Sporadic Tumors in a Comparison to reference Immortalized Ovarian 
Epithelial Cells. 



SEQ 
ID 
NO. 


Nucleic 
Acid 


Description 


Geometric 
mean of 
ratios in 
BRCA1- 
linked 
tumors 


Geometric 
mean of 
ratios in 
sporadic 
tumors 


Fold 
difference 
in 

geometric 
means 


279 


LOC51760^ 


B/K protein 


1.32 


1.1 


0.83 


280 


LRPAP1 


low density lipoprotein-related 
protein-associated protein 1 
(alpha-2-macroglobulin . 
receptor-associated protein 1) 


1.45 


1.13 


0.78 
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Table 8. Markers that were Differentially Expressed Between Combined BRCA- 
Linked Group and Sporadic Tumors in a Comparison to reference Immortalized 
Ovarian Epithelial Cells. 



SEQ 
ID 
NO. 


Nucleic 
Acid 


Description 


Geometric 

ratios in 
BRCA1- 
finked 
tumors 


Geometric 

ITlcaU 01 

ratios in 
sporadic 
tumors 


Fold 
difference 
in 

geometric 
means 


281 


pstpipi 


interferon-gamma IEF SSP 
511 l=Interferon gamma 
upregulated protein 


1.73; 1.41 


1.31; 1.04 


0.76; 0.74 


282 


IDH2 


Isocitrate dehydrogenase 2 
(NADP+), mitochondrial 


1.66 


1.02 


0.61 


274 


PCTK1 


PCTAIRE 1 serine/threonine 
protein kinase 


1.29 


1.12 


0.86 



Table 9. Markers that were Differentially Expressed between BRCAl-like and 
BRCA2-Uke tumors in a Comparison to reference Immortalized Ovarian 
Epithelial Cells. 



SEQ 
ID 
NO: 


Gene 


Description 


Geometric 

moon nf 

ratios in 
BRCA1 


Geometric 

ItlCdD OI 

ratios in 
BRCA2 


Fold 
jjuierence 
in 

Geometric 
Means 


122- 
123 


RGS1 


regulator of G-protein 
signalling 1 


1.79 


4.75 


2.65 


122- 
123 


RGS1 


BL34=RGSl=regulator of G- 
protein signaling which inhibits 
SDF-1 directed B cell 
migration 


2.09 


5.05 


2.41 


594- 
596 


RGS16 


A28-RGS14p=G protein 
signaling regulator 


1.22 


2.32 


1.9 


612- 
614 


SCYA4 


JVfflM beta=SCAY2=G- 
26=HC21=pAT 744=LAG- 
1-Act-2=H400=SIS- 
gamma=chemokine 


1.29 


2.23 


1.73 


612- 
614 


SCYA4 


MIP-1 beta=SCAY2=G- 
26=HC21-pAT 744=LAG- 
l=Act-2=H400=SIS- 
gamma=chemokine 


1.09 


1.79 


1.64 


515- 
517 


SFRP4 


frizzled related protein frpHE 


1.13 


1.85 


1.63 


594- 
596 


RGS16 


A28-RGS14p-G protein 
signaling regulator 


1.33 


2.11 


1.58 


790- 
791 


APEX 


APEX nuclease 
(multifunctional DNA repair 
enzyme) 


0.66 


1.04 


1.58 


682 


SFRS11 


splicing factor, arginine/serine- 
richll 


0.69 


1.09 


1.57 


507- 
509 


PTEN 


MMACl=PTEN=Tumor 
suppressor gene at 10q23.3 that 
is Mutated in Multiple 
Advanced 

Cancers=Phosphatase and 


1.03 


1.56 


1.51 
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tens in homolog 








774 


PDE6A 


phosphodiesterase 6A, cGMP- 
specific, rod, alpha 


1.23 


1.85 


1.51 


562- 
564 


CD83 


CD83=B-G antigen IgV 
domain homolog=B-cell 
activation piotein=HB 1 5 


1.46 


2.19 


1.5 


592- 
593 


ZFP161 


ZF5=POZ domain zinc finger 
protein 


1.03 


1.49 


1.45 






ESTs 


1.17 


1.69 


1.44 


707- 
708 


FOXOIA 


forkheadboxOlA 
(rhabdomyosarcoma) 


1.38 


1.93 


1.4 


762 


FIJ10701 


hypothetical protein FLJ10701 


1 


1.4 


1.39 


577- 
579 


CD36 


CD36 


1.32 


1.82 


1.38 


797- 
799 


WNT2 


Wnt-2 


0.81 


1.12 


1.38 






Unknown 


0.82 


1.13 


1.38 


779 


TCF4 


transcription factor 4 


1.19 


1.62 


1.36 


615- 
617 


KDR 


Kinase insert domain receptor 
(a type III receptor tyrosine 
kinase) 


0.8 


1.08 


1.35 






ESTs 


0.81 


1.09 


1.35 


534- 
535 


LOC51760 


B/K protein 


0.98 


1.32 


1.35 


797- 
799 


WNT2 


Wnt-2 


0.99 


1.33 


1.34 


683- 
684 


CUGBP1 


CUG triplet repeat, RNA- 
binding protein 1 


0.73 


0.98 


1.33 


709 


CSRP2 


cysteine and glycine-rich 
protein 2 


0.98 


1.31 


1.33 






ESTs, Moderately similar to 
ALU4 HUMAN ALU 
SUBFAMILY SB2 
SEQUENCE 
CONTAMINATION 
WARNING ENTRY 
[H.sapiens] 


0.81 


1.06 


1.32 


606- 
608 


MMP13 


MMP-13=Matrix 
metalloproteinase 13=CL- 
3=Collagenase 3 


0.99 


1.3 


1.32 


580- 
582 


PDGFRB 


Platelet-derived growth factor 
receptor, beta 

polypepnde=fused to TEL in 
t(5;12)(q33;pl3) chronic 
myelomonocytic leukemia 


1.41 


1.85 


1.31 


603- 
605 


ENPP1 


PC-1 = alkaline nucleotide 
pyrophosphatase 


1.01 


1.32 


1.31 






FGFR2=Fibroblast growth 
factor receptor 2 


0.8 


1.05 


1.31 


695- 
696 


SMG1 


PI-3-kinase-related kinase 
SMG-1 


1.07 


1.4 


1.31 


521- 
523 


TALI 


scl=tal-l=T-cell acute 
lymphocytic leukemia 1 


1.14 


1.49 


1.31 


727- 
728 


PLXNA2 


plexin A2 


1.32 


1.71 


1.3 


759 


LOC51605 


CGI-09 protein 


0.8 


1.04 


1.3 


784 


TNRC12 


trinucleotide repeat containing 
12 


1.02 


1.33 


1.3 
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EST 


0.86 


1.12 


1.3 


797- 
799 


WNT2 


wingless-type MMTV 
integration site family member 
2 


0.98 


1.27 


1.29 


693- 
694 


POLR2A 


polymerase (RNA) II (DNA 
directed) polypeptide A 
(220kD) 


0.79 


1.01 


1.29 


737- 
738 


FLJ21661 


hypothetical protein FLJ21661 


0.61 


0.79 


1.28 


780- 
781 


RAB2L 


RAB2, member RAS oncogene 
family-like 


1.05 


1.35 


1.28 


577- 
579 


CD36 


CD36 


1.26 


1.61 


1.28 


568- 
570 


RYBP 


Death effector domain- 
associated factor=Binds to 
Caspase 10 DED 
dornain=Homolog of mouse 
RYBP repressor protein that 
interacts withPolycomb 
complex and YY1=YAF2 
homolog=DEDAF=YAF2 
homolog=MLNewGene3 


0.89 


1.14 


1.28 


571- 
573 


ITGAE 


CD103 alpha=Integrin alpha-E 


1.09 


1.38 


1.27 






Human SH3 doniam-containing 
protein SH3P 1 8 mRNA, 
complete cds 


1.23 


1.56 


1.27 


755- 
756 


TMEPAI 


transmembrane, prostate 
androgen induced RNA 


0.99 


1.25 


1.26 


565- 
567 


NCOA1 


SRC-l=steroid receptor 
coactivator 


1.04 


1.3 


1.25 


/o5 


CSNK1E 


casein kinase 1, epsilon 


0.74 


0.92 


1.25 


/05- 

769 


FU21940 


hypothetical protein FLJ2 1 940 


0.91 


1.14 


1.25 


723- 
724 


SAST 


■syntrophin associated 
serme/threonine kinase 


1.05 


1.32 


1.25 






ESTs 


0.89 


1.11 


1.25 


782- 
783 


GOLGA1 


golgi autoantigen, golgin 
subfamily a, 1 


0.76 


0.95 


1.24 


5/4- 
576 


IL7 


IL-7 


0.99 


1.23 


1.24 


319 


RNAC 


RNA cyclase homolo^ 


0.92 


1.13 


1.24 


676- 
677 


KIAA0365 


KIAA0365 gene product 


0.97 


1.2 


1.23 


679« 
681 


ARHGEF6 


KIAA0006 


1.04 


1.28 


1.23 


710- 
711 


BRE 


brain and reproductive organ- 
expressed (TNFRSF1A 
modulator) 


1.11 


1.35 


1.22 






Unknown 


0.9 


1.11 


1.22 


670- 

0/1 


PfYKTi 
rUJNl 


paraoxonase 1 


0.93 


1.14 


1.22 






ESTs, Weakly similar to 
uojoo reverse trails cnpiase 
homolog |"H.sapiensl 


1.11 


1.36 


1.22 • 






ESTs 


0.72 


0.87 


1.21 




ATP7A 


ATPase, Cu-H- transporting, 
alpha polypeptide (Menkes 


0.99 


1.2 


1.21 
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syndrome) 












T T__ 1 

Unknown 


1.2 


L45 


1.21 


/ jj" 

736 


CRB1 


crumbs (Drosophila) homolog 
1 


0.88 


1.06 


1.21 


7S7 

758 


ZNF211 


zinc finger protein 211 


0.82 


0.99 


1.21 






COLS 


0.99 


1.16 


1.18 


685- 
686 




gamma-aminobutyric acid 
(uAJBA) A receptor, pi 


0.91 


1.07 


1.17 


687- 
688 


BMP6 


bone morphogenetic protein 6 


0.95 


LI 


1.16 


587- 
588 


BAD 


T*t A TV 1 .1 .. . /T i j* « a 

JdAU— DDCo=proapoptotic Bcl-2 
homolog 


1.11 


0.94 


0.85 


678 




nicastrin 


1.13 


0.94 


0.83 


766- 
767 

/Of 




hypothetical protein F23 149 1 


1.11 


0.91 


0.82 


701- 
702 


FLJ12442 


hypothetical protein FIJI 2442 


1.04 


0.85 


0.82 


joy- 
591 


ZNP173 


acid finger protein 


1.16 


0.95 


0.81 


7A1 

742 


GCAT 


glycine C-acetyltransferase (2- 
anmio-3-ketobutyrate 
coenzyme A Iigase) 


1.12 


0.91 


0.81 


786 
/OO- 

787 


AFP 


alpha-fetoprotein 


1.2 


0.96 


0.8 






nrAKo5=SER/THR-protein 
kinase PAK-gamma =P21- 
activated kinase 3 


1.05 


0.84 


0.8 


747- 
748 


LOX 


lysyl oxidase 


0.93 


0.75 


0.8 


662- 

003 


UADO 

JtiAKo 


nistidyl-tRNA synthetase 


0.73 


0.57 


0.79 . 


544 


<2T A 1 1 


solute carrier family 25 
(mitochondrial earner; 
oxoglutarate carrier), member 

Li 


1.27 


1 


0.78 


697- 

AOS 


\Jlrtirl 


GP- Imputative G-protein 


0.97 


0.76 


0.78 


700 


GS2NA 


nuclear autoantigen 


0.96 


0.75 


0'.'78 ' 


/ID- 
706 


KIAA0144 


KIAA0144 gene product 


0.87 


0.68 


0.78 j 






Unknown 


1.02 


0.79 


0.78 


733- 
734 


PPY2 


paucrcauc poiypepnae J. 


1 AC\ 

1.49 


1.16 


0.78 


653- 
654 


NAGA 


N-acetylgalactosaminidase, 
alpha- 


1.06 


0.82 


0.78 


583- 

JO*f 


TT 171? 


IL-17 receptor 


1.05 


0.82 


0.78 


657- 
6S8 




solute carrier family 9 
(sooaum/nyarogen excnanger), 
isofonn 1 (antiporter, Na+/H+, 

cUXUIUilUC oCiLMLLVC / 


o.y / 


0.75 


A T7 

0.77 


691- 
692 


RBBP4 


retinoblastoma-binding protein 
4 


0.87 


0.67 


0.76 


609- 
611 


ILK 


IlJ^^tegrm-linked kinase 


0.88 


0.67 


0.76 


624- 


PTK2B 


protein tyrosine kinase PYK2 ' 


L07 


0.81 


0.76 



33 



WO 03/068054 



53/60 



625 












504- 
506 


AKT1 


BPGF-l=bone-derived growth 
factor=Q6= s quiescin=expression 
is induced by reversible growth 
arrest, trypsinization and serum 
starvation and is blocked by 
SV40 transformation 


0.75 


0.56 


0.75 


763- 
764 


DKFZP564C186 


DKFZP564C186 protein 


0.97 


0.73 


0.75 


554- 
556 


SCYB5 


ENA78=chemokine 


0.34 


0.25 


0.75 


660- 
661 


MNATl 


menage a trois l (CAK 
assembly factor) 


1.24 


0.93 


0.75 


548- 
550 


TCEB2 


Elongin B=RNA polymerase II 
transcription factor SHI p 1 8 
subunit 


0.98 


0.73 


0.74 


792- 
793 


RBBP2 


retinoblastorna-binding protein 
2 


1.47 


1.08 


0.74 


626- 
628 


\jf A "OTTO 

MAP2K3 


Dual specificity mitogen- 
activated protein kinase kinase 
3 


1.06 


0.78 


0.74 


712- 
115 


DAT *\r 

KALY 


RNA-binding protein 
(autoantigenic) 


0.85 


0.62 


0.74 


743- 
744 


P14L 


similar to Bos taurus Pl4 
protein 


0.87 


0.64 


0.73 


731- 
732 


APMCFl 


APMCFl protein 


' 0.95 


0.7 


0.73 


674- 
675 


GNB2 


guanine nucleotide binding 
protein (G protein), beta 
polypeptide 2 


0.99 


0.73 


0.73 






Lymphotoxin-Beta receptor 
precursor=Tumor necrosis 
factor receptor 2 related 
protein=Tumor necrosis factor 
C receptor 


1.28 


0.93 


0.73 






ESTs 


1.02 


0.73 


0.72 


666- 
667 


MPI 


mannose phosphate isomerase 


1.22 


0.87 


0.71 


719- 
720 


PEF 


PEF protein with a long N- 
terminal hydrophobic domain 
(peflin) 


1 


0.71 


• 

0.71 


651- 
652 


FDFTl 


farnesyl-diphosphate 
famesyltransferase l 


1.09 


0.77 


0.71 


/jy- 
740 


RAB3A 


KAbJA, member KAo 
oncogene family 


0.8 


0.57 


0.71 






EST 


1.49 


1.05 


0.71 


621- 
623 


PPP2R5A 


phosphatase 2A B56-alpha 
(PP2A) 


0.89 


0.63 


0.71 


600- 
602 


GART 


Phosphoribosylglycinamide 
forrnyltransferase, 
phosphoribosylglycinamide 
synthetase, 

phosphoribosylarninoimidazole 
synthetase 


0.82 


0.58 


0.7 


551- 
553 


NM23H1 


nm23-Hl=NDP kinase 
A=Nucleoside dephophate 
kinase A 


0.71 


0.49 


0.7 


655- 
656 


SECRET 


secretagogin 


0.84 


0.58 


0.69 
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Unknown 


0.8 


0.55 


0.69 






EST 


1 1.19 


0.82 


0.69 


HO- 
ll 1 


FLJ22059 


hypothetical protein FLJ22059 


0.7 


0.49 


0.69 


659 


TUFM 


Tu translation elongation 
factor, mitochondrial 


! 1.16 


0.8 


0.69 


518- 
520 


RUNX1 


core binding factor alphalb 
subunit=CBF 
alphal=PEBP2aAl 
transcription factor =AML1 
Proto-oncogene=translocated in 
acute myeloid leukemia 


0.69 


0.47 


0.69 


585- 
586 


HGF 


Hepatocyte growth factor 
(hepapoietin A; scatter factor) 


0.88 


0.6 


0.68 






EST, Weakly similar to 
PRPP HUMAN SALIVARY 
PROLINE-RICH PROTEIN II- 
1 [H.sapiens] 


0.98 


0.67 


0.68 






Unknown 


1.02 


0.7 


0.68 






Human mRNA for unknown 
product, partial cds 


0.57 


0.39 


0.68 


618- 
620 


IL18R1 


IL-18 receptor l=IL-lRrp 


0.88 


0.6 


0.68 


510- 
512 


UBL1 


ubiquitin-homology domain 
protein PIC 1 


1.49 


1.01 


0.68 


703- 
704 


KIAA0218 


KIAA0218 gene product 


1.2 


0.81 


0.68 


760- 
761 


MAPRE1 


microtubule-associated protein, 
RP/EB family, member 1 


0.6 


0.4 


0.67 


777 


SF3B4 


splicing factor 3b, subunit 4, 
49kD 


1.17 


0.78 


0.67 






Unknown 


1.04 


0.7 


0.67 






ESTs 


0.42 


0.28 


! 0.66 


533 


UBE1 


ubiquitin-activanng enzyme El 
(A1S9T and BN75 temperature 
sensitivity complementing) 


1.54 


1.01 


0.66 






Unknown 


1.23 


0.81 


0.66 


788- 
789 


COVA1 


cytosolic ovarian carcinoma i 
antigen 1 


0.57 


0.37 


0.64 






Unknown 


0.82 


0.53 


0!64 


668- 
669 


TAGLN2 


transgelin 2 


0.7 


0.44 


0.64 


749- 1 
750 


PISD 


phosphatidylserine 
decarboxylase 


0.8 


0.5 


0.62 


775- 
776 


IL1B 


interleukin 1, beta 


0.3 


0.17 


0.58 


429- 
430 


CALU 


calumenin 


0.7 


0.39 


0.56 


597- 
599 


PPP1CB 


PPPlCB=Protein phosphatase 
1, catalytic subunit, beta 
isoform 


0.78 


0.44 


0.56 


751- 
752 




ESTs, Weakly similar to A 
Chain A, Cyclophilin A 
[H.sapiens] 


0.78 


0.43 


0.55 


745- 
746 


KIAA0008 1 


KIAA0008 gene product 


0.49 


0.27 


0.55 






Unknown 


2 


1.09 


0.54 


664- 


EIF4A1 


eukaryotic translation initiation 


0.7 


0.37 


0.53 
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s" /- r 

665 




factor 4A, isoform 1 








559- 
561 


S100A4 


SI 00 calcium binding protein 
A4=Placental calcium binding 
orotein^Cal va <! r 1 1 1in=mtc 1 

^/IWlwiU VOiVag^UUU ililol 

PROTEIN=CAPL 


2.28 


1.21 


0.53 


375 


PPIA 


peptidylprolyl isomerase A 
(cyclophilin A) 


0.76 


0.39 


0.52 






EST 


2.56 


1.26 


0.49 



! 
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Table 10. Markers that can be used to Classify BRCAl-hke from BMCA2-like 
Tumor Types using Compound Covariate Prediction Analysis. 



SEQ 
ID 
NO: 


Gene 


JL/CavA ijp UU11 


t- 
Yalue 


Log ratios 

& 

BRCA2- 

like 
sporadics* 


Midpoint of 
average log- 
ratios in 
each class 


Average log ratios 
in 

BRCA1&BRCA1- 
like sporadic 


659 


TUFM 


Tu translation elongation 
factor, mitochondrial 


-10 


-0.09854 


-0.016 


0.067443 


749- 
750 


PISD 


phosphatidylserine 
decarboxylase 


8.2305 


-0.28567 


-0.187 


-0.08778 


745- 
746 


KIAA0008 


KIAA0008 gene product 


8.0421 


-0.56864 


-0.431 


-0.29414 


703- 
704 


KIAA0218 


KIAA0218 gene product 


7.9288 


-0.08197 


-0.005 


0.071882 


751- 
752 


EST 


ESTs, Weakly similar to A 
Chain A, Cyclophiiin A 
[H.sapiens] 


7.6225 


-0.34775 


-0.225 


-0.10292 


621- 
623 


PPP2R5A 


.phosphatase 2 A B56-alpha 
(PP2A) 


-7.469 


-0.20343 


-0.121 


-0.03763 


733- 
734 


PPY2 


pancreatic polypeptide 2 


7.3866 


0.06558 


0.113 


0.160168 


649 


EST 


Unknown 


-7.384 


-0.27327 


-0.183 


-0.09313 


641 


EST 


EST, Weakly similar to 
PRPP HUMAN 
SALIVARY PROLINE- 
RICH PROTEIN II-l 
[H.sapiens] 


7.3561 


-0.17457 


-0.095 


-0.01592 


375 


PPIA 


peptidylprolyl isomerase A 
(cyclophiiin A) 


6.9946 


-0.38934 


-0.258 


-0.12668 


770- 
771 


FLJ22059 


hypothetical protein 
FLJ22059 


6.9726 


-0.31605 


-0.228 


-0.14026 • 


739 


BAB3A 


RAB3A, member RAS 
oncogene family 


6.9458 


-0.23582 


-0.167 


-0.098 


655- 
656 


SECRET 


secretagogin 


6.9307 


-0.23657 


-0.147 


-0.0575 


629- 
631 


TNFR2RP 


Lymphotoxin-Beta receptor 
precursor=Tumor necrosis 
factor receptor 2 related 
protein=Tumor necrosis 
factor C receptor 


6.9268 


-0.02733 


0.038 


0.103462 


551- 
553 


NM23H1 


nm23-Hl=NDP kinase 
A=Nucleoside dephophate 
kinase A 


6.8307 


-0.32239 


-0.242 


-0.16241 


557- 
558 


PAK2 


hPAK65=SER/THR-protein 
kinase PAK-gamma =P21- 
activated kinase 3 


6.7214 


-0.1152 


-0.05 


0.01536 


806 


APRT 


adenine 

phosphoribosyltransferase 


6.6725 


0.044932 


0.085 


0.125156 


807- 
808 


PPP1R15A 


protein phosphatase 1, 
regulatory (inhibitor) 
sub-unit 15A 


-6.648 


-0.21681 


-0.144 


-0.0716 


544 


SLC25A11 


solute carrier family 25 
(mitochondrial carrier, 
oxoglutarate carrier), 
member 11 


6.6083 


0.003461 


0.047 


0.089905 
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719- 
720 


PEF 


PEF protein with a long N- 
terminal hydrophobic 
domain (peflin) 


6.6034 


-0.16368 


-0.088 


-0.01144 


747- 
748 


LOX 


lysyl oxidase 


6.4441 


-0.12784 


-0.077 


-0.02641 


775- 
776 


IL1B 


interleukin 1, beta 


6.4272 


-0.75203 


-0.637 


-0.52288 


809- 
810 


MPHOSPH10 


M-phase phosphoprotein 10 
(U3 small nucleolar 
ribonucleoprotein) 


-6.425 


-0.10347 


-0.048 


0.007748 


653- 
654 


NAGA 


N-acetylgalactosaminidase, 
alpha- 


-6.42 


-0.09205 


-0.034 


0.024896 


760- 
761 


MAPRE1 


microtubule-associated 
protein, RP/EB family, 
member 1 


-6.39 


-0.39254 


-0.296 


-0.19928 


811- 
812 


ARD1 


N-acetyltransferase, 
homolog of S. cerevisiae 
ARD1 


_ 

6.3833 


-0.1707 


-0.11 


-0.04915 


813- 

O 1 A 

814 


CDC6 


CDC6 (cell division cycle 6, 
S. cerevisiae) homolog 


-6.371 


-0.25964 


-0.201 


-0.14327 


643 


EST 


EST 


6.3541 


0.133858 


0.253 


0.371068 


583- 
584 


IL17R 


IL-17 receptor 


6.3499 


-0.08991 


-0.035 


0.019532 


803 


WNT5B 


wingless-type MMTV 
integration site family, 
member 5B 


6.3391 


-0.06803 


-0.017 


0.035029 


651- 
652 


FDFT1 


farnesyl-diphosphate 
farnesyltransferase 1 


6.3387 


-0.1152 


-0.038 


0.039414 


664- 
665 


EEF4A1 


eukaryotic translation 
initiation factor 4A, isoform 
1 


6.2705 


-0.39362 


-0.263 


-0.13253 


650 


EST 


Unknown 


6.2573 


-0.08355 


-0.002 


0.079181 


657- 
658 


SLC9A1 


solute carrier family 9 
(sodium/hydrogen 
exchanger), isoform 1 
(antiporter, Na+/H+, 
amiloride sensitive) 


— 

6.2571 


-0.13608 


-0.077 


-0.01682 


731- 
732 


APMCF1 


APMCF1 protein 


6.2387 


-0.15677 


-0.083 


-0.00922 


503 


ZNF220 


zinc ringer protein 220 


6.2316 


-0.13549 


-0.072 


-0.00922 


815- 
816 


PTPRCAP 


LPAP=lymphoid-restricted 
phosphoprotein=CD45 
phosphatase binding protein 
and putative substrate 


-6.229 


-0.10182 


-0.059 


-0.01592 


817 


POLD3 


polymerase (DNA directed), 
delta 3 


-6.223 


-0.29843 


-0.221 


-0.14327 


788- 
789 


COVA1 


cytosolic ovarian carcinoma 
antigen 1 


6.1802 


-0.42946 


-0.321 


-0.21325 


701- 
702 


FLJ12442 


hypothetical protein 
FLJ12442 


6.1607 


-0.07676 


-0.027 


0.023252 


818 


POT 

EST 


unknown 


- 

6.1033 


-0.12494 


-0.057 


0.011993 


721- 
722 


EST 


Human mRNA for 
unknown product, partial 
cds 


6.1032 


-0.40671 


-0.328 


-0.24949 


662- 


HARS 


histidyl-tRNA synthetase 




-0.24642 


-0.186 


-0.12552 
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.663 






6.0889 








819 


FLJ20746 


putative cyclin Gl 
interacting protein 


6.0827 


-0.04144 


0.017 


0.074451 


820- 
821 


PLK 


pjLfjs.— nomoiogue oi 
Drosophila polo 
ocruiw injcoDinc Kinase 


6.0725 


-0.32422 


-0.253 


-0.18177 


784 


TNRC12 


trinucleotide repeat 
containing iz 


6.0744 


0.110253 


0.062 


0.01368 


723- 
724 


SAST 


syntrophin associated 


£. AAA1 

6.0991 


0.107549 


0.065 


0.022016 


634- 
635 


EST 


ESTs 


£11 TO 

O.J 1 /o 


A A/l orin 

0.048053 


-0.002 


-0.05306 


695- 
696 


SMG1 


PI-3-kinase-related kinase 
SMG-1 


A 1 1 1 0 


U. 14643 8 


0.087 


0.028571 


534- 
535 


LOC51760 




O.lJUo 


0.09691 


0.048 


-0.00174 


574- 
576 


IL7 


IL-7 


£ 1 ceo 


0.088845 


0.043 


-0.00305 


670- 
671 


P0N1 


paraoxonase 1 


6.3314 


0.058046 


0.014 


-0.03105 


592- 
593 


ZFP161 


ZF5=POZ domain zinc 
finger protein 


6.3356 


0.196176 


0.101 


0.006466 


632- 
633 


EST 


ESTs, Moderately similar to 
ALU4 J3UMAN ALU 

QT FDt? A \JtTT V ODI 

SEOUENCE 
CONTAMINATION 
WARNING ENTRY 
[H.sapiens] 


6.3426 


0.028978 


-0.029 


-0.08725 


765 






0.4JoJ 


A AO f'rl'l 

0.026533 


-0,03 


-0.08619 


822 


CCR6 


CCR6=STRL22=chemokine 
icL/cpiur ior lviLro 
alpha/LARG/Exodus on 
activated B cells 


6.4897 


0.116276 


0.069 


0.022016 


679- 
681 


ARHGEF6 


KIAA0006 


6.5098 


0 105169 


V/.V./UJ 




759 


LOC51605 


CGI-09 protein 


6.662 


0.0086 


-0 042 




762 


FLJ10701 


hypothetical protein 
FIJI 0701 


6.925 


0.135133 


0.075 


0.0141 


636 


EST 


Unknown 


7.3197 


0.050766 


0.005 


-0.04144 


647 


EST 


EST 


7.5484 


0.045714 


-0.012 


-0.07007 


725- 
726 


EST 


Unknown 


8.058 


0.068557 


-0.006 


-0.07988 



Table 11. Results of Compound Covariate Predictor Analysis. 



Expld 


Pre-specifled class label 


Correctly classified 


B2-1 vs OSEB2-1 vs OSE 
21083 




YES i 


B2-10 vs OSE B2-10 vs OSE 
21085 




YES 


B2-16 vs OSE B2-16 vs OSE 
21180 




YES 


B2-2 vs OSE B2-2 vs OSE 
21090 




YES 


B2-20 vs OSE B2-20 vs OSE 




YES 



39 



WO 03/068054 


59/60 


PCT/US03/04688 


91 181 

ZA lOi 






RO-91 \ra OCR DO 91 -we HCC 
13 Z-Zx VS Von* rJZ-Zl VS UoJd 






01 1 CO 




Yxlo 


R9-99 vc nCP R9 00 vc 

DZ.-Z.il. VS VJOJJ. I5Z-ZZ VS UoC 







21183 


- 


VT7Q 


R9-73 vc 09F R9 93 vc n<3F 







21091 




ICO 


R9-94 vc R9.94 vc H<3F 







21092 


- 




R9-9S vc O^F R9 9S vc 
JO*"iJ Vo won XpZ-Zj VS UoC 







22038 


— — 


I Co 


R9J3 vc OCP R0 3 \/o HQ!? 
UZ-O VS Uoii XJZ-J VS UoH 






21093 




ICO 


B2-4 vs OSE B2-4 vc OSF 


. 




21094 






B2-5 vs OSE B2-5 vc OSF 






21095 






R9.7 vc OSF R9-7 vc OCR 


— — — — 




21096 




ICO 


B2-R v<? OSF B9-R vc OSF 







21097 


- 




B2-9 vs OSE B2-9 vq OSE 






21098 


- 


I Co 


pi on vc osf nnn vc ocr 

vlvv VS V/OJD VylUu VS \JOC* 


— 




21167 




If xSo 


CI 02 vs OSE C102 vc OSE 







21168 




VFQ 


CI 03 vs OSE CI 03 vc OSF 

vlvJ VO \SlDX-i \sX\JJ VO Vulj 







21169 


- 


VF<5 


C105 vs OSE C105 vs OSE 

VlUJ TO \/UJL( VslV/v/ VO v_/ O J—/ 






21178 




YFS 


C107vs OSE C107vs OSE 






21099 




YFS 


CI 1 0 vs OSE CI 1 0 vc OSF 






21101 




I DO 


1 CI 1 1 vc OSE fill vc H^F 

OX X 1 VO vO£i (will Vo V/OJ-» 






21 109 






C1 1 7 vc fiQF P1 1 7 vc OQF 
OX A / Vo WOL Oil / VS UOD 







21105 


- 


VF9 


C1 1 8 vc OSE C1 1 8 vc OSF 






21106 




YFS 


C123 vs OSE C123 vs OSE 






21107 




YES 


C46 vs OSE C46 vs OSE 19741 


i 


NO 


C77 vs OSE C77 vs OSE 21 108 




YFS 


C84 vs OSE C84 vs OSE 21368 




YES 


C85 vs OSE C85 vs OSE 21 179 




YES 


C99 vs OSE C99 vs OSE 21370 




YES 


B36 vs OSE B36 vs OSE 19680 


2 


YES 


B39 vs OSE B39 vs OSE 19682 


2 


YES 


B40 vs OSE B40 vs OSE 19683 


2 


YES 


B41 vs OSE B41 vs OSE 19684 


2 


YES 


B52-2 vs OSE B52-2 vs OSE 






19771 


2 


NO 


B54 vs OSE B54 vs OSE 19687 


2 


YES 


B55 vs OSE B55 vs OSE 19688 


2 


YES 


B60 vs OSE B60 vs OSE 19690 


2 


YES 


B61 vs OSE B61vs OSE 19695 


2 


YES 


B62 vs OSE B62 vs OSE 19701 


2 


YES 



40 



WO 03/068054 


60/60 


PCT/US03/04688 


B63 vs OSE B63 vs OSE 19706 


2 


I no 


B64 vs OSE B64 vs OSE 19713 


1 2 


VRQ 


B70 vs OSE B70 vs OSE 19722 


2 


I YES 


B74 vs OSE B74 vs OSE 19727 


2 


YES 


B77 vs OSE B77 vs OSE 19731 


2 


YES 


B78 vs OSE B78 vs OSE 21 103 


2 


YES 


B79 vs OSE B79 vs OSE 19743 


2 


YES 


B80 vs OSE B80 vs OSE 21088 


2 


YF9 


C114vs OSE CI 14 vs OSE 
21104 


2 


YES 


C15 vs OSE C15 vs OSE 19734 


2 


YES 


C16 vs OSE C16 vs OSE 19735 


2 


YF^ 


C17 vs OSE C17 vs OSE 19736 


2 


YF9 


Civs OSE Civs OSE 19732 


2 


YES 


C20 vs OSE C20 vs OSE 19737 


2 


YES 


C41 vs OSE C41 vs OSE 19739 


2 


YES 


C42 vs OSE C42 vs OSE 19740 


2 


YES 


C49 vs OSE C49 vs OSE 19742 


2 


YES 


C79 vs OSE C79 vs OSE 21367 


2 


YFS 


C87 vs OSE C87 vs OSE 19744 


2 


YES 


C95 vs OSE C95 vs OSE 21369 


2 


YES 




Overall Success 


91.80% 
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